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METHOD OF PRODUCING PLANTS HAVING ENHANCED TRANSPIRATION 
EFFICIENCY AND PLANTS PRODUCED THEREFROM 
FIELD OF THE INVENTION 

The present invention relates to the field of plant breeding and the production of 
5 genetically engineered plants. More specifically, the invention described herein 
provides genes that are capable of enhancing the transpiration efficiency of a plant 
when expressed therein. These genes are particularly useful for the production of 
plants having enhanced transpiration efficiency, by both traditional plant breeding and 
genetic engineering approaches. The invention further extends to plants produced by 
10 the methods described herein. 

BACKGROUND TO THE INVENTION 
1. General 

This specification contains nucleotide and amino acid sequence information prepared 
using Patentln Version 3.1, presented herein after the claims. Each nucleotide 

15 sequence is identified in the sequence listing by the numeric indicator <210> followed 
by the sequence identifier (e.g. <210>1, <210>2, etc). The length and type of sequence 
(DNA, protein (PRT), etc), and source organism for each nucleotide sequence, are 
indicated by information provided in the numeric indicator fields <211>, <212> and 
<213>, respectively. Nucleotide sequences referred to in the specification are defined 

20 by the term "SEQ ID NO:", followed by the sequence identifier (eg. SEQ ID NO: 1 
refers to the sequence in the sequence listing designated as <400>1). 

The designation of nucleotide residues referred to herein are those recommended by the 
IUPAC-IUB Biochemical Nomenclature Commission, wherein A represents Adenine, 

25 C represents Cytosine, G represents Guanine, T represents thymine, Y represents a 
pyrimidine residue, R represents a purine residue, M represents Adenine or Cytosine, K 
represents Guanine or Thymine, S represents Guanine or Cytosine, W represents 
Adenine or Thymine, H represents a nucleotide other than Guanine, B represents a 
nucleotide other than Adenine, V represents a nucleotide other than Thymine, D 

30 represents a nucleotide other than Cytosine and N represents any nucleotide residue. 
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As used herein the term "derived from" shall be taken to indicate that a specified 
integer is obtained from a particular source albeit not necessarily directly from that 
source. 

5 

Throughout this specification, unless the context requires otherwise, the word 
"comprise", or variations such as "comprises" or "comprising", will be understood to 
imply the inclusion of a stated step or element or integer or group of steps or elements 
or integers but not the exclusion of any other step or element or integer or group of 
10 elements or integers. 

Those skilled in the art will appreciate that the invention described herein is susceptible 
to variations and modifications other than those specifically described. It is to be 
understood that the invention includes all such variations and modifications. The 
15 invention also includes all of the steps, features, compositions and compounds referred 
to or indicated in this specification, individually or collectively, and any and all 
combinations or any two or more of said steps or features. 

The present invention is not to be limited in scope by the specific embodiments 
20 described herein, which are intended for the purposes of exemplification only. 
Functionally equivalent products, compositions and methods are clearly within the 
scope of the invention, as described herein. 

2. Description of the related art 

It is well known that virtually all plants require a certain quantity of water for proper 
growth and development, because CO2 fixation and photosynthate assimilation by 
25 plants cost water. A significant quantity of water absorbed by plants from the soil 
returns to the atmosphere via plant transpiration. 
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Transpiration efficiency is a measure of the amount of dry matter produced by a plant 
per unit of water transpired, or, in other words, carbon gain relative to water lost 
through transpiration. 

5 For plants having low transpiration efficiency, or when water is in short supply, the loss 
of water through transpiration can limit key metabolic processes associated with plant 
growth and development. For example, during drought, or when plants having low 
transpiration efficiency are grown in arid and semi-arid environments, plant 
productivity as determined by dry matter production or photosynthetic rate, is 
10 considerably reduced. Accordingly, the production of plants having enhanced water 
use efficiency or transpiration efficiency is highly desirable for their adaptation to arid 
or semi-arid conditions, or to enhance their drought resistance. 

The enhancement of water use efficiency or transpiration efficiency by plants is also 
15 highly desirable in consideration of global climatic change and increasing pressure on 
world water resources. The inefficient utilization of agricultural water is known to 
impact adversely upon the supply of navigable water, potable water, and water for 
industrial or recreational use. Accordingly, the production of plants having enhanced 
transpiration efficiency is highly desirable for reducing the pressure on these water 
20 resources. It is also desirable for increasing plant productivity under well-watered 
conditions. 

By enhancing transpiration efficiency, carbon gain rates are enhanced per unit of water 
transpired, thereby stimulating plant growth under well-watered conditions, or 
25 alternatively, under mild or severe drought conditions. This is achieved by enhancing 
carbon gain more than transpiration rate, or by reducing the amount of water lost at any 
particular rate of carbon fixation. Those skilled in the art also consider that for a given 
growth rate plants having enhanced transpiration efficiency dry out soils more slowly, 
and use less water, than less efficient near-isogenic plants. 

30 
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Several chemical as well as environmental pre-treatments have been described for 
enhancing the ability of plant seedlings to survive drought, either by reducing 
transpiration or by reducing the amount of water that is actually lost to the atmosphere. 

5 Known environmental treatments largely involve the use of physical barriers. Whilst 
placing a physical barrier over plant stomata is known to reduce water loss via 
transpiration, the procedure is not always desirable or practicable for field-grown crops. 
For example, physical barriers over plant stomata may inhibit certain gas-exchange 
processes of the plant. It is more desirable to enhance actual transpiration efficiency or 
10 water use efficiency of the plant through manipulation of intrinsic plant function. 

Chemical agents are typically the so-called "anti-transpirant" or "anti-desiccant" agents, 
both of which are applied to the leaves. Anti-transpirants are typically films or 
metabolic anti-transpirants. 

15 

These products form a film on leaves, thereby either blocking stomatal pores, or 
coating leaf epidermal cells with a water-proof film. Typical film anti-transpirants 
include waxes, wax-oil emulsions, higher alcohols, silicones, plastics, latexes and 
resins. For example, Elmore, United States Patent No. 4,645,682 disclosed an anti- 
20 transplant consisting of an aqueous paste wax; Cushman et al, United States Patent 
Nos. 3,791,839 and 3,847,641 also disclosed wax emulsions for controlling 
transpiration in plants; and Petrucco et al y United States Patent No. 3,826,671, 
disclosed a polymer composition said to be effective for controlling transpiration in 
plants. 

25 

Metabolic anti-transpirants generally close stomata, thereby reducing the rate of 
transpiration. Typical metabolic anti-transpirants include succinic acids, 
phenylmercuric acetate, hydroxysulfonates, the herbicide atrazine, sodium azide, and 
phenylhydrazones, as well as carbon cyanide. 

30 
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Compounds having plant growth regulator activity have also been shown to be useful 
for reducing transpiration. For example, Bliesner et al t United States Patent No. 
4,671,816, disclosed an acetylene compound, said to possess utility for regulating plant 
growth, whilst Kuznetsov et al. (Russian Patent No. SU 1,282,492; ., Russian Patent 

5 Application No. SU 1,253,559-A1), and Smirnov et al (Russian Patent No. SU 
1,098,934) disclosed the use of derivatives of 2-methyl-5-hydroxybenzimidazole, and 
the chloride or bromide salts thereof as anti-transpirant growth regulators. 
Vichnevetskaia (USSN 5,589,437 issued December 31, 1996) also describe 
hydroxybenzimidazole derivatives for enhancing the drought resistance of plants by 

10 reducing transpiration. Schulz et al, United States Patent No. 4,943,3 15, also disclosed 
formulations comprising an acetylene and a phenylbenzylurea compound, for reducing 
transpiration in plants and/or for avoiding impairment to plants caused by heat and dry 
conditions. Abscisic acid has also been shown to reduce or suppress transpiration in 
plants (eg. Helv. Chim. Acta, 71, 931, 1988; J, Org, Chem., 54, 681, 1989; and 

15 Japanese Patent Publication No. 184,966/1991). 

Metabolic anti-transpirants are costly to produce and often exhibit phytotoxic effects or 
inhibit plant growth (Kozlowski (1979), In: Tree Growth and Environmental Stresses 
(Univ. of Washington Press, Seattle and London)), and are not practically used. 

20 

Recent studies have examined alternative methods for enhancing transpiration 
efficiency, particularly breeding approaches to select lines that grow more efficiently 
under mild drought conditions. Carbon isotope discrimination has been used to identify 
Arabidopsis ecotypes with contrasted transpiration efficiencies (Masle et al, In: Stable 
25 isotopes and plant carbon-water relations, Acad. Press, Physiol. Ser., pp371-386, 1993) 
and to assist conventional breeding of new plant varieties in a number of species (Hall 
et al, Plant Breeding Reviews 4, 81-113, 1994) including rice (Farquhar et al, In: 
Breaking the Yield Barrier, ed KG Cassman, IRRI, 95, 101) and most recently wheat 
(Rebetzke et al. Crop Science 42:739-745, 2002). 

30 
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No single gene has been identified as being capable of enhancing transpiration 
efficiency when expressed in planta. Transpiration efficiency may well be multigenic. 
As a consequence, the genes and signalling pathways that regulate the photosynthetic 
and/or stomatal components of the transpiration efficiency mechanism in plants have 
5 not been identified or characterized. 

Moreover, notwithstanding that the effect of down-regulating expression of the Rubisco 
gene, or mutation in genes involved in abscisic acid (eg. aba, abi\ are known to modify 
transpiration efficiency to some extent through stomatal closure, the consequence of 
10 such modifications is not necessarily specific, resulting in pleiotropic effects. 

Arabidopsis thaliana ecotypie Landsberg erecta (L-erl) is one of the most popular 
ecotypes and is used widely for both molecular and genetic studies. It harbors the erl 
mutation, which confers a compact inflorescence, blunt fruits, and short petioles. There 

15 are a number of erecta mutant alleles. Phenotypic characterization of the mutant alleles 
suggests a role for the wild type ER gene in regulating plant morphogenesis, 
particularly the shapes of organs that originate from the shoot apical meristem. Torii et 
al t The Plant Cell 8, 735, 1996, showed that the ER gene encodes a putative receptor 
protein kinase comprising a cytoplasmic protein kinase catalytic domain, a 

20 transmembrane region, and an extracellular domain consisting of leucine-rich repeats, 
which are thought to interact with other macromolecules. 

SUMMARY OF THE INVENTION 

In work leading up to the present invention, the inventors sought to elucidate the 
25 specific genetic determinants of plant transpiration efficiency. In plants, the 
development of molecular genetic markers, such as, for example, genetic markers that 
map to a region of the genome of a crop plant, such as, for example, a region of the rice 
genome, maize genome, barley genome, sorghum genome, or wheat genome, or a 
region of the tomato genome or of any Brassicaceae, assists in the production of plants 
30 having enhanced transpiration efficiency (Edwards et aL, Genetics 116, 113 -125, 
1987; Paterson etaL, Nature 335, 721-726, 1988). 
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The present inventors identified a locus that is linked to the genetic variation in 
transpiration efficiency in plants. To elucidate a locus associated with the transpiration 
efficiency of plants, the inventors established experimental conditions and sampling 
procedures to determine the contribution to total transpiration efficiency of the factors 
5 influencing this phenotype, and, more particularly, the genetic contribution to the total 
variation in transpiration efficiency. Factors influencing transpiration efficiency 
include, for example, genotype of the plant, environment (eg. temperature, light, 
humidity, boundary layer around the leaves, root growth conditions), development (eg. 
age and/or stage and/or posture of plants that modify gas exchange and/or carbon 

10 metabolism), and seed-specific factors (Masle et al. 1993, op. cit). The screens 
developed by the inventors were also used to survey mutant and wild type populations 
for variations in transpiration efficiency and to identify ecotypes having contrasting 
transpiration efficiencies including the parental lines that had been used by Lister and 
Dean (1993). The transpiration efficiencies of the members of Lister and Dean's 

15 (1993) Recombinant Inbred Line (RIL) mapping population were then determined, and 
linkage analyses were performed against genetic markers to determine the chromosome 
regions that are linked to genetic variation in transpiration efficiency, thereby 
identifying a locus conditioning transpiration efficiency. Complementation tests, 
wherein plants were transformed with a wild-type allele at this locus confirmed the 

20 functionality of the allele in determining a transpiration efficiency phenotype. 

In one exemplified embodiment of the invention, there is provided a locus associated 
with transpiration efficiency of A. thaliana, such as, for example the ERECTA locus on 
A. thaliana chromosome 2, or a hybridization probe which maps to the region between 

25 about 46cM and about 50.7cM on chromosome 2 of A. thaliana. In further exemplified 
embodiments, the inventors identified additional ERECTA alleles or erecta alleles in A. 
thaliana, rice, sorghum, wheat and maize which are structurally related to this primary 
A. thaliana ERECTA or erecta allele. Based upon the large number of ERECTA/erecta 
alleles described herein, the present invention clearly extends to any homologs of the A. 

30 thaliana ERECTA locus from other plant species to those specifically exemplified, and 
particularly when those homologs are identified using the methods described herein. 
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Accordingly, one aspect of the invention provides a genetic marker or locus associated 
with the genetic variation in transpiration efficiency of a plant, wherein said locus 
comprises a nucleotide sequence linked genetically to an ERECTA locus in the genome 
5 of the plant. The locus or genetic marker is useful for determining transpiration 
efficiency of a plant. 

As used herein, the terms "genetically linked" and "map to" shall be taken to refer to a 
sufficient genetic proximity between a linked nucleic acid comprising a gene, allele, 

10 marker or other nucleotide sequence and nucleic acid comprising all or part of an 
ERECTA locus to permit said linked nucleic acid to be useful for determining the 
presence of a particular allele of said ERECTA locus in the genome of a plant. Those 
skilled in the art will be aware that for such linked nucleic acid to be used in this 
manner, it must be sufficiently close to said locus not to be in linkage disequilibrium or 

15 to have a high recombination frequency between said linked nucleic acid and said 
locus. Preferably, the linked nucleic acid and the locus are less than about 25cM apart, 
more preferably less than about lOcM apart, even more preferably less than about 5cM 
apart, still even more preferably less than about 3cM apart and still even more 
preferably less than about lcM apart. 



In a preferred embodiment the present invention provides an isolated nucleic acid 
associated with the genetic variation in transpiration efficiency of a plant, said nucleic 
acid comprising a nucleotide sequence selected from the group consisting of: 
(a) the sequence of an ERECTA genomic gene or the 5'-UTR or 3 '-UTR or protein- 



(b) the sequence of an allelic variant of (a) or the 5'-UTR or 3'-UTR or protein- 
encoding region or an intron region of said allelic variant; 

(c) the sequence of a fragment of (a) or (b) that hybridizes specifically to nucleic 
acid (eg., RNA or DNA) from a plant under at least low stringency hybridization 



20 



25 



encoding region or an intron region thereof; 



30 



(d) 



conditions; and 

a sequence that is complementary to (a) or (b) or (c). 
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In a particularly preferred embodiment, the present invention provides an isolated 
ERECTA gene from wheat comprising a nucleotide sequence selected from the group 
consisting of: 
5 (i) the sequence set forth in SEQ ID NO: 19; 

(ii) a sequence encoding the amino acid sequence set forth in SEQ ID NO: 20; and 

(iii) a sequence that is complementary to (i) or (ii). 

In an alternative embodiment, the present invention provides an isolated ERECTA gene 
10 from maize comprising a nucleotide sequence selected from the group consisting of: 

(i) the sequence set forth in SEQ ID NO: 44; 

(ii) a sequence encoding the amino acid sequence set forth in SEQ ID NO: 45; and 

(iii) a sequence that is complementary to (i) or (ii). 

15 In another alternative embodiment, the present invention provides an isolated ERECTA 
gene from rice comprising a nucleotide sequence selected from the group consisting of: 

(i) the sequence set forth in SEQ ID NO: 3; 

(ii) a sequence encoding the amino acid sequence set forth in SEQ ID NO: 4; and 

(iii) a sequence that is complementary to (i) or (ii). 

20 

In another alternative embodiment, the present invention provides an isolated ERECTA 
gene from A. thatliana comprising a nucleotide sequence selected from the group 
consisting of: 

(i) the sequence set forth in SEQ ID NO: 1; 
25 (ii) a sequence encoding the amino acid sequence set forth in SEQ ID NO: 2; and 

(iv) a sequence that is complementary to (i) or (ii). 

In another alternative embodiment, the present invention provides an isolated ERECTA 
gene from A. thatliana comprising a nucleotide sequence selected from the group 
30 consisting of: 

(i) the sequence set forth in SEQ ID NO: 7; 
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(ii) a sequence encoding the amino acid sequence set forth in SEQ ID NO: 8; and 

(v) a sequence that is complementary to (i) or (ii). 

In another alternative embodiment, the present invention provides an isolated ERECTA 
5 gene from A. thatliana comprising a nucleotide sequence selected from the group 
consisting of: 

(i) the sequence set forth in SEQ ID NO: 9; 

(ii) a sequence encoding the amino acid sequence set forth in SEQ ID NO: 10; and 

(vi) a sequence that is complementary to (i) or (ii). 

10 

In yet another alternative embodiment, the present invention provides an isolated 
ERECTA gene from sorghum comprising a nucleotide sequence selected from the 
group consisting of: 

(i) the sequence set forth in SEQ ID NO: 5; 
15 (ii) a sequence encoding the amino acid sequence set forth in SEQ ID NO: 6; and 

(vii) a sequence that is complementary to (i) or (ii). 

Notwithstanding that an ERECTA or erecta structural gene or genomic gene or the 
protein encoding region thereof is particularly useful for breeding and/or mapping 
purposes, this aspect of the present invention is not to be limited to the ERECTA or 
erecta structural or genomic gene or the protein-encoding region thereof. As 
exemplified herein, the primary A, thaliana ERECTA locus can be determined using 
any linked nucleic acid that maps to a region in the chromosome at a genetic distance 
of up to about 3cM from the ERECTA or erecta allele. The skilled artisan will readily 
be able to utilize similar probes to identify linkage to an ERECTA or erecta allele in 
any other plant species, based upon the teaching provided herein that the ERECTA or 
erecta allele is linked to the transpiration efficiency phenotype of plants. 

Preferably, all or part of the locus associated with the transpiration efficiency 
30 phenotype in a plant (ie., nucleic acid genetically linked to the ERECTA or erecta 
structural or genomic gene) is provided as recombinant or isolated nucleic acid, such 



20 



25 
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as, for example, in the form of a gene construct (eg. a recombinant plasmid or cosmid), 
to facilitate germplasm screening. 

The ERECTA locus or a gene that is linked to the ERECTA locus is particularly useful 
5 in a breeding program, to predict the transpiration efficiency of a plant, or alternatively, 
as a selective breeding marker to select plants having enhanced transpiration efficiency. 
Once mapped, marker-assisted selection (MAS) is used to introduce the ERECTA locus 
or markers linked thereto into a wide variety of populations. MAS has the advantage of 
reducing the breeding population size required, and the need for continuous recurrent 
10 testing of progeny, and the time required to develop a superior line. 

Accordingly, a further aspect of the present invention provides a method of selecting a 
plant having enhanced transpiration efficiency, comprising detecting a genetic marker 
for transpiration efficiency which marker comprises a nucleotide sequence linked 
15 genetically to an ERECTA locus in the genome of the plant and selecting a plant that 
comprises or expresses the genetic marker, preferably wherein the genetic marker 
comprises an ERECTA allele or erecta allele, or a protein-encoding portion thereof, or 
alternatively, wherein the genetic marker comprises a nucleotide sequence having at 
least about 55% overall sequence identity to at least about 20 nucleotides in length of 
20 any one of SEQ ID Nos: 1, 3, 5, 7, 9, 1 1 to 19 or 21 to 44 or a complementary sequence 
thereto, including a nucleotide sequence selected from the group consisting of: 
(a) a sequence having at least about 55% identity to a sequence selected from the 
group consisting of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 
7, SEQ ID NO: 9, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID 
25 NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, 

SEQ ID NO: 19 SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID 
NO: 24, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, 
SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 32 SEQ ID 
NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, 
30 SEQ ID NO: 38; SEQ ID NO: 39, SEQ ID NO: 40, SEQ ID NO: 41, SEQ ID 

NO: 42, SEQ ID NO: 43, SEQ ID NO: 44; 
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(b) a sequence encoding an amino acid sequence having at least about 55% identity 
to an amino acid sequence selected from the group consisting of SEQ ID NO: 2, 
SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 
12, SEQ ID NO: 20 and SEQ ID NO: 45; and 
5 (c) a sequence complementary to (a) or (b). 

In an alternative embodiment, the invention provides a method of selecting a plant 
having enhanced transpiration efficiency, comprising: 

(a) screening mutant or near-isogenic or recombinant inbred lines of plants to 
10 segregate alleles at an ERECTA locus; 

(b) identifying a polymorphic marker linked to said ERECTA locus; and 

(c) selecting a plant that comprises or expresses the marker. 

The data exemplified herein for A. thaliana or rice can clearly be extrapolated to other 
15 plant species. For example, the evidence provided herein for the role of the A, thaliana 
ERECTA allele in determining the transpiration efficiency phenotype in those plant 
species has permitted the elucidation of a wide range of homologous ERECTA alleles in 
other plant species, in particular wheat, rice, sorghum and maize, that are also likely to 
determine the transpiration efficiency phenotype in those plants. In accordance with 
20 this embodiment, the present invention provides a method of selecting a plant having 
enhanced transpiration efficiency, comprising selecting a plant that comprises or 
expresses a functionally equivalent homolog of a protein-encoding region of the 
ERECTA gene of A. thaliana, maize, wheat, sorghum or rice. 

25 In a preferred embodiment, the invention provides a method of selecting a plant having 
enhanced transpiration efficiency, comprising: 

(a) identifying a locus on the Arabidopsis chromosome 2 (46-50.7 cM) or rice 
chromosome 6 associated with genetic variation in transpiration efficiency in a 
plant; 
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(b) identifying nucleic acid in a different plant species that comprises a nucleotide 
sequence having at least about 55% identity to the sequence of the locus at (a); 
and 

(c) selecting a plant that comprises or expresses the identified nucleic acid at (b). 

5 

In a further preferred embodiment, this aspect of the invention provides a method of 
selecting a plant having enhanced transpiration efficiency, comprising: 

(a) identifying a locus on the Arabidopsis chromosome 2 (46-50.7 cM) or rice 
chromosome 6 associated with genetic variation in transpiration efficiency in a 

10 plant; 

(b) determining the nucleotide sequence of the identified locus; 

(c) identifying nucleic acid of a plant species other than A. thaliana or rice that 
comprises a nucleotide sequence having at least about 55% identity to the 
sequence of the locus at (a); and 

15 (d) selecting a plant that comprises or expresses the identified nucleic acid at (b). 

Preferably, the selected plant according to any one or more of the preceding 
embodiments is Arabidopsis thaliana, rice, sorghum, wheat or maize, however other 
species are not excluded. 

20 

Preferably, the subject selection method comprises linking the transpiration efficiency 
phenotype of the plant to the expression of the marker in the plant, or alternatively, 
linking a structural polymorphism in DNA to a transpiration efficiency phenotype in 
the plant, eg., by a process comprising detecting a restriction fragment length 

25 polymorphism (RFLP), amplified fragment length polymorphism (AFLP), single strand 
chain polymorphism (SSCP) or microsatellite analysis. As will be known to the skilled 
artisan, a nucleic acid probe or primer of at least about 20 nucleotides in length from 
any one of SEQ ID Nos: 1, 3, 5, 7, 9, 11 to 19 or 21 to 44 or a complementary sequence 
thereto can be hybridized to genomic DNA from the plant, and the hybridization 

30 detected using a detection means, thereby identifying the polymorphism. 
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It is clearly preferred that the selected plant has enhanced transpiration efficiency 
compared to a near-isogenic plant that does not comprise or express the genetic marker. 

As exemplified herein, the inventors also identified specific genes or alleles that are 
5 linked to the ERECTA locus of A. thaliana, and rice and determined the transpiration 
efficiencies of those plants. More particularly, the transpiration efficiencies of near- 
isogenic lines, each carrying a mutation within an ERECTA locus, and a correlation 
between transpiration efficiency phenotype and ERECTA expression or gene copy 
number are determined, thereby providing the genetic contribution of genes or alleles at 

10 the ERECTA locus to transpiration efficiency. This analysis permits an assessment of 
the genetic contribution of particular alleles to transpiration efficiency, thereby 
determining allelic variants that are linked to a particular transpiration efficiency. 
Thus, the elucidation of the ERECTA locus for transpiration efficiency in plants 
facilitates the fine mapping and determination of allelic variants that modulate 

15 transpiration efficiency. The methods described herein can be applied to an assessment 
of the contribution of specific alleles to the transpiration efficiency phenotype for any 
plant species that is amenable to mutagenesis such as, for example, by transposon 
mutagenesis, irradiation, or chemical means. As will be known to the skilled artisan 
many crop species, such as, maize, wheat, and rice, are amenable to such mutagenesis. 

20 

Accordingly, a third aspect of the invention provides a method of identifying a gene 
that determines the transpiration efficiency of a plant comprising: 
(a) identifying a locus associated with genetic variation in transpiration efficiency in 
a plant; 

25 (b) identifying a gene or allele that is linked to said locus, wherein said gene or 
allele is a candidate gene or allele for determining the transpiration efficiency of 
a plant; and 

(c) determining the transpiration efficiencies of a panel of plants, wherein not all 
members of said panel comprise or express said gene or allele, and wherein 
30 variation in transpiration efficiency between the members of said panel indicates 

that said gene is involved in determining transpiration efficiency. 
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In another embodiment, the method comprises: 

(a) identifying a locus associated with genetic variation in transpiration efficiency in 
a plant; 

5 (b) identifying multiple alleles of a gene that is linked to said locus, wherein said 
gene is a candidate gene involved for determining the transpiration efficiency of 
a plant; and 

(c) determining the transpiration efficiencies of a panel of plants, wherein each 
member of said panel comprises, and preferably expresses, at least one of said 
10 multiple alleles, wherein variation in transpiration efficiency between the 

members of said panel indicates that said gene is involved in determining 
transpiration efficiency. 

Preferably, the identified gene or allele identified by the method described in the 
15 preceding paragraph is an ERECTA allele, or an erecta allele, from a plant selected 
from the group consisting of A. thaliana, sorghum, rice, maize and wheat, or a homolog 
thereof. 

The identified gene or allele, including any homologs from a plant other than A. 
20 thaliana, such as, for example, the wild-type ERECTA allele or a homolog thereof, is 
useful for the production of novel plants. Such plants are produced, for example, using 
recombinant techniques, or traditional plant breeding approaches such as introgression. 

Accordingly, a still further aspect of the present invention provides a method of 
25 modulating (i.e., enhancing or reducing) the transpiration efficiency of a plant 
comprising ectopically expressing in a plant an isolated ERECTA gene or an alleic 
variant thereof or the protein-encoding region of said ERECTA gene or said allelic 
variant. In a particularly preferred embodiment, the invention provides a method of 
enhancing the transpiration efficiency of a plant comprising introgressing into said 
30 plant a nucleic acid comprising a nucleotide sequence that is homologous to a protein- 
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encoding region of a gene of A. thaliana that maps to the ERECTA locus on 
chromosome 2. 

A further embodiment of the invention provides a method of modulating the 
5 transpiration efficiency of a plant comprising introducing (eg., by classical breeding, 
introgression or recombinant means), and preferably expressing therein, an isolated 
ERECTA gene or an allelic variant thereof or the protein-encoding region thereof to a 
plant and selecting a plant having a different transpiration efficiency compared to a 
near-isogenic plant that does not comprise the introduced ERECTA gene or allelic 
10 variant or protein-encoding region. Preferably, the ERECTA gene or allelic variant or 
protein-encoding region comprises a nucleotide sequence selected from the group 
consisting of: 

(a) a sequence having at least about 55% identity to a sequence selected from the 
group consisting of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 

15 7, SEQ ID NO: 9, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID 

NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, 
SEQ ID NO: 19 SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID 
NO: 24, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, 
SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 32 SEQ ID 

20 NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, 

SEQ ID NO: 38; SEQ ID NO: 39, SEQ ID NO: 40, SEQ ID NO: 41, SEQ ID 
NO: 42, SEQ ID NO: 43 and SEQ ID NO: 44; and 

(b) a sequence encoding an amino acid sequence having at least about 55% identity 
to an amino acid sequence selected from the group consisting of SEQ ID NO: 2, 

25 SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 

12, SEQ ID NO: 20 and SEQ ID NO: 45. 

The plant into which the gene etc is introduced is preferably selected from the group 
consisting of Arabidopsis thaliana, rice, sorghum, wheat and maize. As will be 
30 apparent from the present disclosure, the transpiration efficiency is enhanced as a 
consequence of the ectopic expression of an ERECTA allele or the protein-encoding 
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region thereof in the plant. In contrast, the transpiration efficiency is reduced as a 
consequence of reduced expression of an ERECTA allele in the plant (eg., by 
expression of antisense KNA or KNAi or other inhibitory RNA). 

5 A further aspect of the invention provides for the use of an isolated ERECTA gene or an 
allelic variant thereof or the protein-encoding region of said ERECTA gene or said 
allelic variant in the preparation of a gene construct for modulating (ie., enhancing or 
reducing) the transpiration efficiency of a plant. For example, expression of ERECTA 
protein in the plant can be modified by ectopic expression of an ERECTA allele in the 

10 plant, or alternatively, by reducing endogenous ERECTA expression using an 
inhibitory RNA (eg, antisense or RNAi). 

A fifth aspect of the present invention provides a plant having enhanced transpiration 
efficiency, wherein said plant is produced by a method described herein. 

15 

Plants that have enhanced transpiration efficiency show increased levels of growth 
under normal growth conditions, thereby increasing their biomass. Accordingly, a 
further aspect of the present invention provides a method of increasing the biomass of a 
plant comprising enhancing the level of expression of an ERECTA gene or allelic 
20 variant thereof or protein coding region thereof in said plant. 

In one embodiment, the method further includes the step of selecting a plant that has an 
increased biomass when compared to an unmodified plant. Methods of determining the 
biomass of a plant are well known to those skilled in the art and/or described herein. 

25 

In one embodiment, the level of expression is enhanced by genetic modification of a 
control sequence, for example a promoter sequence, associated with the ERECTA gene 
or allelic variant thereof. 

30 In another embodiment, the level of expression is enhanced by introducing (eg., by 
classical breeding, introgression or recombinant means) an ERECTA gene or allelic 
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variant thereof or the protein encoding region thereof to a plant. Preferably, the 
ERECTA gene or allelic variant or protein-encoding region comprises a nucleotide 
sequence selected from the group consisting of: 

a sequence having at least about 55% identity to a sequence selected from the group 
5 consisting of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID 
NO: 9, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID 
NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19 SEQ ID 
NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID 
NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID 
10 NO: 31, SEQ ID NO: 32 SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID 
NO: 36, SEQ ID NO: 37, SEQ ID NO: 38; SEQ ID NO: 39, SEQ ID NO: 40, SEQ ID 
NO: 41, SEQ ID NO: 42, SEQ ID NO: 43 and SEQ ID NO: 44; and 
a sequence encoding an amino acid sequence having at least about 55% identity to an 
amino acid sequence selected from the group consisting of SEQ ID NO: 2, SEQ ID 
15 NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 
20 and SEQ ID NO: 45. 

The plant into which the gene etc is introduced is preferably selected from the group 
consisting of Arabidopsis thaliana, rice, sorghum, wheat and maize. 

20 

A further aspect of the present invention provides a method of increasing the resistance 
of a plant to an environmental stress comprising enhancing the level of expression of an 
ERECTA gene or allelic variant thereof or protein coding region thereof in said plant. 

25 As used herein the term "environmental stress" shall be taken in its broadest context to 
mean one or more environmental conditions that reduce the ability of a plant to grow, 
survive and/or produce seed/grain. In one embodiment, an environmental stress that 
affects the ability for a plant to grow, survive and/or produce seed/grain is a condition 
selected from the group consisting of increased or decreased CO2 levels, increased or 

30 decreased temperature, increased or decreased rainfall, increased or decreased 
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humidity, increased salt levels in the soil, increased soil strength and compaction and 
drought. 

In one embodiment, the method further includes the step of selecting a plant that has an 
5 altered resistance to an environmental stress when compared to an unmodified plant is 
selected. Methods of determining the resistance of a plant to environmental stress are 
well known to those skilled in the art and/or described herein. 

In one embodiment, the level of expression is enhanced by genetic modification of a 
10 control sequence, for example a promoter sequence, associated with the ERECTA gene 
or allelic variant thereof. 

In another embodiment, the level of expression is enhanced by introducing (eg., by 
classical breeding, introgression or recombinant means) an ERECTA gene or allelic 
15 variant thereof or the protein encoding region thereof to a plant. Preferably, the 
ERECTA gene or allelic variant or protein-encoding region comprises a nucleotide 
sequence selected from the group consisting of: 

a sequence having at least about 55% identity to a sequence selected from the group 
consisting of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID 

20 NO: 9, SEQ ID NO. 11, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID 
NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ED NO: 18, SEQ ID NO: 19 SEQ ID 
NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID 
NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID 
NO: 31, SEQ ID NO: 32 SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID 

25 NO: 36, SEQ ID NO: 37, SEQ ID NO: 38; SEQ ID NO: 39, SEQ ID NO: 40, SEQ ID 
NO: 41, SEQ ID NO: 42, SEQ ID NO: 43 and SEQ ID NO: 44; and 
a sequence encoding an amino acid sequence having at least about 55% identity to an 
amino acid sequence selected from the group consisting of SEQ ID NO: 2, SEQ ID 
NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 

30 20 and SEQ ID NO: 45. 



WO 2004/005555 




•CT/AU2003/000854 



20 



The plant into which the gene etc is introduced is preferably selected from the group 
consisting of Arabidopsis thaliana, rice, sorghum, wheat and maize. 

A further aspect of the present invention provides a plant having increased resistance to 
5 environmental stress, wherein said plant is produced by a method described herein. 

Both temperature and available moisture have been shown to dramatically influence 
pollination and grain/seed development, processes known as seed-set and grain-filling. 
Accordingly, a method that produces a plant that is resistant to environmental stress, ie 
10 a plant that has increased transpiration efficiency, results in increased or more efficient 
grain-filling and greater seed number. As ERECTA is expressed during flowering or 
pod development this gene or an allelic variant thereof is useful for increasing grain- 
filling in a plant. 

15 Accordingly, a further aspect of the present invention provides a method of increasing 
seed or grain weight in a plant comprising enhancing the level of expression of an 
ERECTA gene or allelic variant thereof or protein coding region thereof in said plant. 

In one embodiment, the method further includes the step of selecting a plant that has 
20 increased seed or grain weight when compared to an unmodified plant is selected. 
Methods of determining seed or grain weight are well known to those skilled in the art 
and/or described herein. 

In one embodiment, the level of expression is enhanced by genetic modification of a 
25 control sequence, for example a promoter sequence, associated with the ERECTA gene 
or allelic variant thereof. 

In another embodiment, the level of expression is enhanced by introducing (eg., by 
classical breeding, introgression or recombinant means) an ERECTA gene or allelic 
30 variant thereof or the protein encoding region thereof to a plant. Preferably, the 
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ERECTA gene or allelic variant or protein-encoding region comprises a nucleotide 
sequence selected from the group consisting of: 

a sequence having at least about 55% identity to a sequence selected from the group 
consisting of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID 
5 NO: 9, SEQ ID NO: 1 1, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID 
NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19 SEQ ID 
NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID 
NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID 
NO: 31, SEQ ID NO: 32 SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID 

10 NO: 36, SEQ ID NO: 37, SEQ ID NO: 38; SEQ ID NO: 39, SEQ ID NO: 40, SEQ ID 
NO: 41, SEQ ID NO: 42, SEQ ID NO: 43 and SEQ ID NO: 44; and 
a sequence encoding an amino acid sequence having at least about 55% identity to an 
amino acid sequence selected from the group consisting of SEQ ID NO: 2, SEQ ID 
NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 

15 20 and SEQ ID NO: 45. 

The plant into which the gene etc is introduced is preferably selected from the group 
consisting of Arabidopsis thaliana, rice, sorghum, wheat and maize. 

20 A further aspect of the present invention provides a plant having increased seed or 
grain weight, wherein said plant is produced by a method described herein. 

A still further aspect of the present invention provides a method of modulating the 
number of seeds produced by a plant comprising enhancing the level of expression of 
25 an ERECTA gene or allelic variant thereof in said plant. 

In one embodiment, the method further includes the step of selecting a plant that has an 
increased number of seeds when compared to an unmodified plant is selected. Methods 
of determining seed or grain number are well known to those skilled in the art and/or 
30 described herein. 



WO 2004/005555 



KTT/AU2003/000854 



22 

In one embodiment, the level of expression is enhanced by genetic modification of a 
control sequence, for example a promoter sequence, associated with the ERECTA gene 
or allelic variant thereof. 

5 In another embodiment, the level of expression is enhanced by introducing (eg., by 
classical breeding, introgression or recombinant means) an ERECTA gene or allelic 
variant thereof or the protein encoding region thereof to a plant. Preferably, the 
ERECTA gene or allelic variant or protein-encoding region comprises a nucleotide 
sequence selected from the group consisting of: 

10 a sequence having at least about 55% identity to a sequence selected from the group 
consisting of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID 
NO: 9, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID 
NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19 SEQ ID 
NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID 

15 NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID 
NO: 31, SEQ ID NO: 32 SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID 
NO: 36, SEQ ID NO: 37, SEQ ID NO: 38; SEQ ID NO: 39, SEQ ID NO: 40, SEQ ID 
NO: 41, SEQ ID NO: 42, SEQ ID NO: 43 and SEQ ID NO: 44; and 
a sequence encoding an amino acid sequence having at least about 55% identity to an 

20 amino acid sequence selected from the group consisting of SEQ ID NO: 2, SEQ ID 
NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 
20 and SEQ ID NO: 45. 

The plant into which the gene etc is introduced is preferably selected from the group 
25 consisting of Arabidopsis thaliana, rice, sorghum, wheat and maize. 

A further aspect of the present invention provides a plant having an increased number 
of seeds, wherein said plant is produced by a method described herein. 



30 BRIEF DESCRIPTION OF THE DRAWINGS 
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Figure la is a graphical representation showing the CO2 assimilation rates (fimol C m 2 
s" 1 ) of several genotypes of A. thaliana. Measurements were completed on rosette 
leaves during bolting and flowering stages. Plants were grown on fertilised soil. The 
genotypes of plants are indicated on the x-axis, and CO2 assimilation rates indicated on 
5 the ordinate. Col indicates a genetic background of the ecotype Columbia. Ld indicates 
a genetic background of the ecotype Landsberg. Plants expressing wild type ERECTA 
alleles were either in a Col (Co\4-ER) or Ld (Ld-ER) background. Plants that were 
homozygous for a mutant er allele were either in a Ld background (Ld-erl) or in a Col 
background (Co\-erl05 or CoVer2 (line 3401 atNASC, also named Col-erl06 by Torii 
10 and collaborators ( see Lease et al. 2001, New Phytologist, 151:133-143)). Plants 
designated as Fl (Col-Eft x Ld-er) were heterozygous ER/erl. Data indicate that, in a 
Col background, the erl05 mutation leads to reduced CO2 assimilation rate, whilst the 
erl mutation enhances CO2 assimilation rate in a Ld background. 

15 Figure lb is a graphical representation showing the stomatal conductance (mol H2O m 2 
s" 1 ) of several genotypes of A. thaliana (same plants as Fig. la). The genotypes of 
plants are indicated on the x-axis and are the same as described in the legend to Figure 
la. Stomatal conductances are indicated on the ordinate. Data indicate that, in a Col 
background, the er2/erl06 mutation significantly enhances stomatal conductance, 

20 whilst the erl mutation significantly enhances stomatal conductance in a Ld 
background. 

Figure lc is a graphical representation showing the transpiration efficiency of (mmol C 
mol H2O' 1 ) of several genotypes of A. thaliana, as determined by the ratio of CO2 

25 assimilation rate to stomatal conductance. The genotypes of plants are indicated on the 
x-axis and are the same as described in the legend to Figure la. Transpiration efficiency 
is indicated on the ordinate. Data indicate that transpiration efficiency is enhanced in 
plants expressing a wild type ER allele relative to a mutant er allele, in both Ld and Col 
backgrounds. The lowest transpiration efficiency was observed for plants that are 

30 homozygous for the erl 05 allele (ie. Col-erlOS), consistent with the fact that this allele 
inhibits ERECTA expression. From the data in Figures la-lc, it is apparent that the 
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lower transpiration efficiency of plants expressing the erl05 allele is largely due to a 
reduced CO2 fixation rate, whereas for both the er2/erl06 and erl alleles, reduced 
transpiration efficiency is largely due to enhanced stomatal conductance. The 
transpiration efficiency of the Fl heterozygote plant was intermediate between the 
5 transpiration efficiencies of its parents, suggesting codominance of these alleles. The 
Fl, however, had a transpiration efficiency closer to that of the pollen donor parent, Ld- 
erl. 

Figure 2a is a graphical representation showing the stomatal densities (Number of 

10 stomata mm* 2 leaf) for several genotypes of A. thaliana in three independent 
experiments. The genetic backgrounds of plants are indicated on the x-axis (Col, 
Columbia; Ld, Landsberg), and stomatal densities are indicated on the ordinate. Plant 
genotypes are indicated at the top of each bar, as follows: plants expressing wild type 
ERECTA alleles in a Col background were CoUER or ColLEK (hatched bars); plants 

15 expressing wild type ERECTA alleles in a Ld background were ER (open bars); plants 
expressing mutant erecta alleles in a Col background were either erl 05 or er2/106 (Col 
filled boxes); and plants expressing the mutant erl allele in a Ld background were erl 
(Ld filled boxes). Columns designated a,b are data from two experiments where plants 
were grown in soil in the absence of fertiliser. The set of columns at the right of the 

20 figure are from a third experiment where the same plants were grown in soil 
comprising fertiliser. Data indicate that, in a Col background, the erl05 mutation and 
er2/106 mutation enhances stomatal density, which in part accounts for the enhanced 
stomatal conductances and reduced transpiration efficiencies of plants expressing these 
alleles (Figures lb and lc). The general effect of these alleles is not dependent on the 

25 nutrient status of the soil. In contrast, the erl allele only enhanced stomatal density of 
Ld plants when fertiliser was absent, suggesting that in this ecotype enhanced stomatal 
, aperture accounted for the enhanced stomatal conductances and reduced transpiration 
efficiencies measured in the erl mutant under ample nutrient supply (Figures lb, lc). 
The erl mutation therefore affects both stomatal aperture and stomatal density but the 

30 relative contributions of these effects to enhanced stomatal conductance per unit leaf 
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area depend on environmemtal factors and plant nutrient status, and on genetic 
background. 

Figure 2b is a graphical representation showing the epidermal cell size (surface area, 
5 |im 2 ) for several genotypes of A. thaliana in three independent experiments. The 
genetic backgrounds and genotypes of plants are indicated on the x-axis and at the tops 
of each column, respectively, as in the legend to Figure 2a. The ordinate indicates 
epidermal cell size. Columns designated a,b are data from two experiments where 
plants were grown in soil in the absence of fertiliser. The set of columns at the right of 

10 the figure are from a third experiment where the same plants were grown in soil 
comprising fertiliser. Data indicate that, in a Col background, the erlOS mutation and 
er2/erl06 mutation significantly reduce epidermal cell size ie increase the number of 
epidermal cells per unit leaf area. This reveals that the ER gene has effects on leaf 
histogenesis which, beyond their consequences on stomatal densities, may also directly 

15 affect leaf capacity for photosynthesis and therefore transpiration efficiency, (Figures 
lb and lc). The general effects of these alleles are not dependent on the nutrient status 
of the soil. In contrast, in a Ld background, the erl allele reduced epidermal cell size 
only when fertiliser was absent. 

20 Figure 2c is a graphical representation showing the stomatal index for several 
genotypes of A. thaliana in three independent experiments. The genetic backgrounds 
and genotypes of plants are indicated on the x-axis and at the tops of each column, 
respectively, as in the legend to Figure 2a. The ordinate indicates stomatal index, as 
determined from the ratio of stomatal density to epidermal cell density. Columns 

25 designated a,b are data from two experiments where plants were grown in soil in the 
absence of fertiliser. The set of columns at the right of the figure are from a third 
, experiment where the same plants were grown in soil comprising fertiliser. Data 
indicate that the er mutations tested do not significantly modify stomatal index in Col 
background (because increases in stomatal density are correlated to increases in 

30. epidermal cell numbers in the Col mutant plants) but does so in Landsberg background. 
Accordingly, the ER gene does appear to directly modify stomatal development per se. 
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Taken together Figures 2a-c therefore show that the ERECTA gene has two types of 
effects on leaf stomatal conductance: a) developmental, b) biophysical and/or 
biochemical. The expression of these effects and impact on transpiration rate vary with 
genetic background, suggesting interactions with other genes that are polymorphic 
5 between the Col and Ld ecotypes, and also with nutrient status. 

Figure 3 is a graphical representation showing carbon isotope composition (y-axis; in 
per mil, for vegetative rosettes) for 7 different experimental runs (numbers 1-7) carried 
out under growth cabinet conditions and glasshouse conditions. For each run, the left- 
10 hand side bar shows the mean value of carbon isotope composition for lines carrying 
the ERECTA allele, while the right-hand side bar shows the mean value across lines 
with the erecta allele. In all cases, 8 13 C isotopic composition values for the er-lines are 
more negative then those for ER lines, indicative of lower transpiration efficiencies. 

15 Figure 4a is a graphical representation showing ERECTA gene copy number and 
expression levels in transgenic T2 A. thaliana plants homozygous for an ER transgene. 
These lines were generated by transforming the Col-er2/106 mutant with the wild type 
ER gene under the 35S promoter. Effective transformation was ascertained and 
ERECTA expression levels were quantified in several independent transformants using 

20 real-time quantitative PCR (ABI PRISM 7700, Sequence Detection System User 
Bulletin #2. 1997). Copy number (y-axis) is indicated as a function of the plant line, 
following normalisation of ERECTA relative to the copy number of a control gene (18S 
ribosomal RNA gene). The expression of the 18S rRNA gene was shown 
independently not to be affected by changes in ER expression. Line 143 is null control 

25 (no insert). Lines 145, 165, 169 and 279 are transformed lines carrying the ERECTA 
allele. All ER transgenic lines, except line 145, show increased mRNA copy number: 
from 4 to 9.5 fold increase compared with the null control. 

Figure 4b is a graphical representation showing ERECTA gene copy number and 
30 expression levels in transgenic T2 A. thaliana plants homozygous for an ER transgene, 
and generated by transformation of the Col-erl05 mutant. Effective transformation 
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was ascertained and ERECTA expression levels were quantified in several independent 
transformants using real-time quantitative PCR (ABI PRISM 7700, Sequence Detection 
System User Bulletin #2. 1997). Copy number (y-axis) is indicated as a function of the 
plant line, following normalisation of ERECTA relative to the copy number of a control 
5 gene (18S ribosomal RNA gene). The expression of the 18S rRNA gene was shown 
independently not to be affected by changes in ER expression. Line 18 is a null control 
line (no ER insert, ie similar to Col-erl05). Lines 8, 19, 29 and 61 are transgenic lines 
carrying the ERECTA allele. All ER transgenic lines show increased mRNA copy 
number: from 10 to 170 fold increase compared with the null control. 

10 

Figure 4c is a graphical representation showing ERECTA gene copy number and 
expression levels in Col and Ld ER ecotypes and in one Ld-ER transgenic line (3-7K) 
generated by transformation of the Ld-erl ecotype (NW20) with the ER wild type gene 
under control of the 35S promoter. Effective transformation was ascertained and 

15 ERECTA expression levels were quantified in several independent transformants using 
real-time quantitative PCR (ABI PRISM 7700, Sequence Detection System User 
Bulletin #2. 1997). Copy number (y-axis) is indicated as a function of the plant line, 
following normalisation of ERECTA relative to the copy number of a control gene (18S 
ribosomal RNA gene). The expression of the 18S rRNA gene was shown 

20 independently not to be affected by changes in ER expression. Lines 933, 1093 and 
3176 are the non-transformed Columbia-JSRECTM ecoptypes Col-4, Col-0 and Col-1. 
Line 105 c is a Col-erl05 line (knockout for ER), used for generating transgenic lines 
shown in Figure 4b. Lines labelled 2c and 3401 on the X-axis describe Col- er2/106 (2 
batches of seeds, used for generating transgenic lines shown in Figure 4a). Line NW20 

25 is Ld-erl. Line 3-7K is a Ld-ER transformant, obtained from transformation of Ld-erl 
with the ERECTA allele. Line 3 177 is the Ld-ER ecotype, near-isogenic to NW20. 

Figure 5a is a graphical representation of a first experiment showing copy number of 
the mRNA transcription product of the rice ERECTA gene in various plant organs/parts, 
30 cv Nipponbare. L= mature leaf blades; YL= young expanding leaves, still enclosed in 
sheaths of older leaves; R= mature root; YR= young root; SEN sheaths; INF= unfolded 
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young panicle still enclosed in sheaths; 07: young panicles. Rice ERECTA mRNA 
copy numbers were determined by quantitative real-time PCR, with 18S mRNA as 
internal control gene for normalization of results. The values on the y-axis describe 
fold increases of rice ERECTA mRNA in various parts compared to the L sample 
5 (mature leaves) set to a value of 1 for normalization. Data show a similar expression 
pattern as the ERECTA gene in Arabidopsis (see Torii et al. 1996) ie preferential 
expression in young meristematic tissues, especially in reproductive organs. 

Figure 5b is a graphical representation of a second experiment showing copy number of 
10 the mRNA transcription product of the rice ERECTA gene in various plant organs/parts. 
L= mature leaf blades; YL= young expanding leaves, still enclosed in sheaths of older 
leaves; R= mature root; YR= young root; SH= sheaths; INF= unfolded young panicle 
still enclosed in sheaths; 07: young panicles. Rice ERECTA mRNA copy numbers 
were determined by quantitative real-time PCR, with 18S mRNA as internal control 
15 gene for normalization of results. The values on the y-axis describe fold increases of 
rice ERECTA mRNA in various parts compared to the L sample (mature leaves) set to a 
value of 1 for normalization. Data confirm those shown in Figure 5a. 

Figure 6 is a graphical representation showing leaf transpiration efficiency (mmol C 
20 mol H2O' 1 , Figure 6a), calculated from the direct measurements of leaf C0 2 
assimilation rate (nmol C in 2 s" 1 , Figure 6b) and stomatal conductance (mol H 2 0 m* 2 s" 
l 9 Figure 6c) by gas exchange techniques, under 350 ppm CO2 (ie same as ambient 
[CO2] during seedling growth; left hand bar in each pair of bars) and 500ppm CO2 
(right hand bar in each pair of bars), for Ld-erl, and two Ld_ER lines: line T2(+ER), a 
25 T2 transgenic line homozygous for an ER transgene in the Ld-erl background and line 
3177, an ER ecotype near-isogenic to Ld-erl (NASC Stock Centre information). 
Genotypes are shown at the bottom of the figure. Leaf temperature during 
measurements was controlled at 22°C, leaf to air vapour pressure deficit at around 8mb. 

30 Figure 7 is a graphical representation showing leaf transpiration efficiency (mmol C 
mol H2O -1 , Figure 7a), calculated from the direct measurements of leaf CO2 
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assimilation rate (pmol C m" 2 s" 1 , Figure 7b) and stomatal conductance (mol H 2 0 m* 2 s* 
l 9 Figure 7c) by gas exchange techniques, under 350 ppm CO2 (ie same as ambient 
[CO2] during seedling growth; left hand bar in each pair of bars) and 500ppm C0 2 
(right hand bar in each pair of bars), for 4 genotypes: Col4 (ER) (left hand pair), Ld 
5 (erl) (right hand pair) and their Fx progeny (middle two pairs). Genotypes are shown at 
the bottom of the figure. 

Figure 8 is a graphical representation showing stomatal conductance and epidermal 
anatomy at 350ppm CO2 in the genotypes described in Figures 6 and 7 and shown at 

10 the bottom of the figure. The insertion of ER transgene (line T2+ER) caused a 
decreased in stomatal conductance compared to the Ld-erl line (Figure 8a), which was 
in part due to a decrease in stomatal density (see Figure 8c). These two effects again 
indicate complementation. Together Figure 8b and 8c show that the decrease in 
stomatal density is relatively more important than that in epidermal cell density, 

15 indicating an effect of the transgene on epidermis development. 

Figure 9 is a graphical representation showing a comparison of stomatal density and 
epidermal cell area in a range of Col er lines carrying mutations in the ER gene (bars 1 
to 8 Fig. 9a; bar 1 to 7 in Fig. 9b, mutants erl05, erl06, 108, 111, 114, 116, 117, as 

20 described in Lease et al. 2001; a gift from Dr Keiko Torii) and in Col-ER wild type 
ecotypes (bars 9-11 or 8-10 in Figures 9a and 9b, respectively: ColO, background 
ecotype for these mutants; Coll, Col4 (ColER parental line for QTL analysis of Lister 
and Dean's RILs), two Ld_erl lines (NW20 and CS20, bars 12&13 and 11&12 in Figs 
9a and 9b respectively, two very similar lines according to NASC; NW20 is the other 

25 parental line for Lister and Dean's RILs) and finally line T2+ER, a transgenic Ld-ER 
line carrying the ER wild type gene in Ld-erl background (extreme right hand bar on 
the figure). 

Figure 10 is a graphical representation showing carbon isotopic composition (per mil, 
30 y-axis) in a range of lines (numbered 1 to 19 on the x-axis): Col-er mutants (line 1-14); 
the ColO background ecotype (line 15); Ld-erl lines (lines 16 and 17); an Ld-ER near 
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isogenic ecotype to Ld-erl (line 18, line 3177 at NASC), and a transgenic T2 Ld-ER 
line (line numbered 19) obtained by transformation of Ld-erl mutant with a construct 
carrying the wild type ER allele. The data show that the ER allele gives less negative 
values indicative of increased transpiration efficiency. 

5 

Figure 1 1 is a graphical representation showing direct measurements of transpiration 
efficiency in Col-er mutants transformed with ER transgene, under both high and low 
air humidity, such as occurs during hot temperature events causing or associated with 
drought. Transpiration efficiency was measured by gas exchange techniques on mature 

10 leaves of vegetative Arabidopsis rosettes, as a function of leaf-to-air vapour pressure 
difference (vpd) ie air humidity around the leaves. The higher the vpd, the drier the air. 
Solid circles describe measurements for 5 independent transgenic T2 lines homozygous 
for an ER transgene; these lines were generated by transforming the Col-erl05 mutant 
(empty squares) with a construct carrying the ER allele under control of the 35S 

15 promoter. Data for null lines (ie lines that went through transgenesis but do not carry 
the ER transgene) are represented by solid squares. This figure demonstrates 
complementation, across the whole range of humidity tested, with the transpiration 
efficiencies in T2 ER lines being greater than those in the complemented Col-er 105 
mutant, and similar to those measured in the ColO-ER ecotype (empty triangles; 

20 background ecotype for Col-er 105). 

Figure 12 is a graphical representation of an alignment of isolated sequences with the 
entire coding region of the wheat ortholog of ERECT A. The position of each of the 
isolated sequences is shown relative to the wheat ortholog of ERECT A. Sequences are 
25 represented by either SEQ ID NO. or gene accession number. 

Figure 13 is a graphical representation of an alignment of isolated sequences with the 
entire coding region of the maize ortholog of ERECT A. The position of each of the 
isolated sequences is shown relative to the maize ortholog of ERECT A. Sequences are 
30 represented by either SEQ ID NO. or gene accession number. 
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Figure 14 is a graphical representation of a pairwise sequence alignment of the 
ERECTA proteins isolated from Arabidopsis (SEQ ID NO: 2), maize (SEQ ID NO: 
45), rice (SEQ ID NO: 3), Sorghum (SEQ ID NO: 5) and wheat (SEQ ID NO: 20). The 
alignment was performed using CLUSTALW multiple sequence alignment tool. 
5 Residues that are conserved between all species are indicated by asterisks (*). 
Conservation of the groups STA NEQK NHQK NDEQ QHRK M1LV MILF HY or 
FYW is indicated by ":". Conservation of the groups CSA ATV SAG STNK STPA 
SGND SNDEQK NDEQHK NEQHRK FVLIM HFY is indicated by " Gaps are 
indicated by dashes 

10 

Figure 15 is a graphical representation of a phylogenetic tree indicating the relationship 
between each of the ERECTA proteins isolated from Arabidopsis (SEQ ID NO: 2), 
maize (SEQ ID NO: 45), rice (SEQ ID NO: 3), Sorghum (SEQ ID NO: 5) and wheat 
(SEQ ID NO: 20). 

15 

DETAILED DESCRIPTION OF THE INVENTION 
Loci for transpiration efficiency and their identification 

One aspect of the invention provides a locus associated with the genetic variation in 
20 transpiration efficiency of a plant, wherein said locus comprises a nucleotide sequence 
linked genetically to an ERECTA locus in the genome of the plant. 

As used herein, the term "locus" shall be taken to mean the location of one or more 
genes in the genome of a plant that affects a quantitative characteristic of the plant, in 

25 particular the transpiration efficiency of a plant. In the present context, a "quantitative 
characteristic" is a phenotype of the plant for which the phenotypic variation among 
different genotypes is continuous and cannot be separated into discrete classes, 
irrespective of the number of genes that determine or control the phenotype, or the 
magnitude of genetic effects that single gene has in determining the phenotype, or the 

30 magnitude of genetic effects of interacting genes. 
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By "associated with the genetic variation in transpiration efficiency of a plant" means 
that a locus comprises one or more genes that are expressed to determine or regulate the 
transpiration efficiency of a plant, irrespective of the actual rate of transpiration 
achieved by the plant under a specified environmental condition. 

5 

Preferably, the locus of the invention is linked to or comprises an ERECTA allele or 
erecta allele, or a protein-encoding portion thereof 

As used herein, the term "ERECTA" shall be taken to refer to a wild type allele 
10 comprising the following domains GTIGYIDPEYARTS, GAAQGLAYLHHDC, and 
TENLSEKXIIGYGAS STVYKC domains, wherein X means Y or H, or domains more 
than 94 % identical to these domains. To the inventors' knowledge no other protein 
comprises these domains. Preferred ERECTA alleles comprise a nucleotide sequence 
having at least about 55% overall sequence identity to the protein-encoding region of 
15 any one of the exemplified ERECTA alleles described herein, particularly any one of 
SEQ ID Nos: 1, 3, 5, 7, 9, 11, 13, or 15. Preferably, the percentage identity to any one 
of said SEQ ID NOs: is at least about 59-61%, or 70% or 80%, and more preferably at 
least about 90%, and still more preferably at least about 95% or 99%. 

20 Preferred ERECTA alleles are derived from, or present in, the genome of a plant that is 
desiccation or drought intolerant, or poorly adapted for growth in dry or arid 
environments, or that suffers from reduced vigor or growth during periods of reduced 
rainfall or drought, or from the genome of a plant with increased growth rate or growth 
duration or partitioning of C to shoot and harvested parts under well-watered 

25 conditions. 

More preferably, an ERECTA allele is derived from, or present in, the genome of a 
brassica plant, broad acre crop plant, perennial grass (eg. of the subfamily Pooidaea, or 
the Tribe Poeae), or tree. Even more preferably, an ERECTA allele is present in or 
30 derived from the genome of a plant selected from the group consisting of barley, wheat, 
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rye, sorghum, rice, maize, Phalaris aquatica, Dactylus glomerata, Lolium perenne, 
Festuca arundinacea y cotton, tomato, soybean, oilseed rape, poplar, and pine. 

The term "erectd" shall be taken to mean any allelic variant of the wild-type ERECTA 
5 allele that modifies transpiration efficiency of a plant 

Preferred erecta alleles include the following A. thaliana erecta alleles derived from 
Columbia (Col) and Landsberg erecta (er) lines. 

10 



Erecta alleles 1 


Genomic position 


Lesion 


Affected domain 


Ler er-1 


2249 


T-*A 


PK 


Col er-101 


6565 


T->A 


PK 


Col er-102/106 


6565 


T-+A 


PK 


Col er-103 


846 


G-»A 


LRR10 


Col er-105 


foreign DNA insert 
between +5 and 
+1056 


insertion 


NuU allele 


Col er-108 


5649 


G^A 




Col er-111 


5749 


G->A 


Untranslated region 
between LRR and 
transmembrane 
domains 


Col er-113 


3274 


C-»T 




Col er-114 


6807 


G-> A 


PK 


Col er-115 


3796 


C— T 




Col er-116 


6974 


G— > A 


PK 


Col er-1 17 


5203 


G— A 


LRR18 



1 alleles described by Lease et al. 2001, New Phytologist, 151: 133-143, except for Ler 
er-1, Col er-103 and Col-erl05 which were described in Torii et al., 1996 , The Plant 
Cell 8:735-746 

15 

The present invention clearly encompasses an erecta allele derived from, or present in, 
the genome of a plant that is desiccation or drought intolerant, or poorly adapted for 
growth in dry or arid environments, or that suffers from reduced vigor or growth during 
20 periods of reduced r ainf all or drought, or from the genome of a plant with increased 
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growth rate or growth duration or partitioning of C to shoot and harvested parts under 
well-watered conditions. 

More preferably, an erecta allele is derived from, or present in, the genome of a 
5 brassica plant, broad acre crop plant, perennial grass (eg. of the subfamily Pooidaea, or 
the Tribe Poeae), or tree. Even more preferably, an erecta allele is present in or derived 
from the genome of a plant selected from the group consisting of barley, wheat, rye, 
sorghum, rice, maize, Phalaris aqziatica, Dactylus glomerata, Lolitan peremie, Festuca 
arundinacea, cotton, tomato, soybean, oilseed rape, poplar, and pine. 

10 

For the purposes of nomenclature, the nucleotide sequence of the Arabidopsis thaliana 
ERECTA protein-encoding region and the 5' -untranslated region (UTR) and 3'-UTR, is 
provided herein as SEQ ID NO: 1. The amino acid sequence of the polypeptide 
encoded by SEQ ID NO: 1 is set forth herein as SEQ ID NO: 2. 

15 

A particularly preferred ERECTA allele from rice (Oryza sativa) is derived from 
chromosome 6 of that plant species. For the purposes of nomenclature, the protein- 
encoding region of the rice ERECTA gene is provided herein as SEQ ID NO: 3. The 
amino acid sequence of the polypeptide encoded by SEQ ID NO: 3 is set forth herein as 
20 SEQ ID NO: 4. 

A particularly preferred ERECTA gene derived from the genome of Sorghum bicolor, is 
provided herein as SEQ ID NO: 5. The amino acid sequence of the polypeptide 
encoded by SEQ ID NO: 5 is set forth herein as SEQ ID NO: 6. 

25 

A further exemplary ERECTA gene derived from A. thaliana is provided herein as SEQ 
ID NO: 7. The amino acid sequence of the polypeptide encoded by SEQ ID NO: 7 is 
set forth herein as SEQ ID NO: 8. 
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A further exemplary ERECTA gene derived from A. thaliana is provided herein as SEQ 
ID NO: 9. The amino acid sequence of the polypeptide encoded by SEQ ID NO: 9 is 
set forth herein as SEQ ID NO: 10. 

5 Fragments of an exemplary ERECTA gene derived from the genome of wheat are 
provided herein as SEQ ID NOs: 1 1 to 18. 

An exemplary ERECTA gene derived from the genome of wheat is provided herein as 
SEQ ID NO: 19. The amino acid sequence of the polypeptide encoded by SEQ ID NO: 
10 19 is set forth herein as SEQ ID NO: 20. 

Fragments of an exemplary ERECTA gene derived from the genome of maize are 
provided herein as SEQ ID NOs: 21 to 43. 

15 An exemplary ERECTA gene derived from the genome of maize is provided herein as 
SEQ ID NO: 44. The amino acid sequence of the polypeptide encoded by SEQ ED NO: 
44 is set forth herein as SEQ ID NO: 44. 

The present invention clearly contemplates the presence of multiple genes that are 
20 genetically linked or map to the specified ERECTA locus on chromosome 2. Without 
being bound by any theory or mode of action, such multiple linked genes may interact, 
such as, for example, by epistatic interaction, to determine the transpiration efficiency 
phenotype. 

25 The present invention also contemplates the presence of different alleles of any gene 
that is linked to the ERECTA locus, wherein said allele is expressed to determine the 
transpiration efficiency phenotype. In one embodiment, such alleles are identified by 
detecting a particular transpiration efficiency phenotype that is linked to the expression 
of the particular allele. Alternatively, or in addition, the different alleles linked to a 

30 locus are identified by detecting a structural polymorphism in DNA (eg. a restriction 
fragment length polymorphism (RFLP), amplified fragment length polymorphism 
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(AFLP), single strand chain polymorphism (SSCP), and the like), that is linked to a 
particular transpiration efficiency phenotype. 

The present invention clearly encompasses all interacting genes and/or alleles that are 
5 genetically linked to an ERECTA locus and are expressed to determine a transpiration 
efficiency phenotype. Such linked interacting genes and/or alleles will map to an 
ERECTA locus and be associated with the transpiration efficiency of that plant. 
Preferably, such interacting genes and/or alleles comprise a protein-encoding portion of 
a gene positioned within the ERECTA locus of the genome that is associated with the 
10 transpiration efficiency of that plant. 

Homologs and/or orthologs of the exemplified alleles are clearly encompassed by the 
invention. Those skilled in the art are aware that the terms "homolog" and "ortholog" 
refer to functional equivalent units. In the present context, a homolog or ortholog of a 
15 gene that maps to an ERECTA locus shall be taken to mean any gene from a plant 
species that is functionally equivalent to a gene that maps to an exemplified ERECTA 
locus, and comprises a protein-encoding region in its native plant genome that shares a 
degree of structural identity or similarity with a protein-encoding region of the 
exemplified ERECTA gene. 

20 

Preferably, a homologous or orthologous gene from a plant other than A. thaliana will 
be associated with the transpiration efficiency of said plant and be linked to a protein- 
encoding region in its native plant genome that comprises a nucleotide sequence having 
at least about 55% overall sequence identity to a protein-encoding region linked to the 
25 ERECTA locus. Even more preferably, the percentage identity will be at least about 
59-61% or 70% or 80%, still more preferably at least about 90%, and even still more 
preferably at least about 95%. 

In determining whether or not two nucleotide sequences fall within a particular 
30 percentage identity limitation recited herein, those skilled in the art will be aware that it 
is necessary to conduct a side-by-side comparison or multiple alignment of sequences. 
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In such comparisons or ahgnments, differences may arise in the positioning of non- 
identical residues, depending upon the algorithm used to perform the alignment m the 
present context, reference to a percentage identity between two or more nucleotide 
sequences shall be taken to refer to the number of identical residues between said 
5 sequences as determined using any standard algorithm known to those skilled in the art 
For example, nucleotide sequences may be aligned and their identity calculated using 
the BESTFTT program or other appropriate program of the Computer Genetics Group 
Inc., Umversity Research Park, Madison, Wisconsin, United States of America' 
(Devereaux * al, Nucl. Acids Res. 12, 387-395, 1984). In detennining percentage 
10 Entity of nucleotide sequences using a program known in the art or described herein, 
it is preferable that default parameters are used. 

Alternatively, or in addition, a homologous or orthologous ERECTA or erecta allele 
will be associated with the transpiration efficiency of a plant and be linked to a protein- 

15 encoding region in its native plant genome that comprises a nucleotide sequence that 
encodes a polypeptide having at least about 55% overall sequence identity to a 
polypeptide encoded by a protein-encoding region linked to the ERECTA locus 
Preferably, the percentage identity at the amino acid level will be at least about 59-61% 
or 70% or 80%, more preferably at least about 90%, and still more preferably at least 

20 about 95%. 

In detennining whether or not two amino acid sequences fell within these percentage 
linuts, those skilled in the art will be aware that it is necessary to conduct a side-by-side 
comparison or multiple alignment of sequences. In such comparisons or alignments 
25 duTerences will arise in the positioning of non-identical residues, depending upon the 
algonthm used to perform the alignment, m the present context, reference to a 
percentage identity or similarity between two or more amino acid sequences shall be 
taken to refer to the number of identical and similar residues respectively, between said 
sequences as determined using any standard algorithm known to those skilled in the art 
30 For example, amino acid sequence identities or similarities may be calculated using the 
GAP program and/or aligned using the PILEUP program of the Computer Genetics 
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Group, Inc., University Research Park, Madison, Wisconsin, United States of America 
(Devereaux et al, 1984, supra). The GAP program utilizes the algorithm of Needleman 
and Wunsch, J. Mol Biol 48, 443-453, 1970, to maximize the number of 
identical/similar residues and to minimize the number and length of sequence gaps in 
5 the alignment. Alternatively or in addition, wherein more than two amino acid 
sequences are being compared, the ClustalW program of Thompson et al, Nucl Acids 
Res. 22, 4673-4680, 1994, is used. In determining percentage identity of amino acid 
sequences using a program known in the art or described herein, it is preferable that 
default parameters are used. 

10 

Alternatively, or in addition, a homologous or orthologous ERECTA or erecta allele 
will be associated with the transpiration efficiency of a plant and be linked to a protein- 
encoding region in its native plant genome that hybridizes to nucleic acid that 
comprises a sequence complementary to a protein-encoding region linked to an 
15 ERECTA locus, such as, for example, from A. thalicma, rice, sorghum, maize, wheat or 
rice. Preferably, such homologs or orthologs will be identified by hybridization under 
at least low stringency conditions, and more preferably under at least moderate 
stringency or high stringency hybridization conditions. 

20 For the purposes of defining the level of stringency, a low stringency is defined herein 
as being a hybridization or a wash carried out in 6xSSC buffer, 0.1% (w/v) SDS at 
28°C or alternatively, as exemplified herein. Generally, the stringency is increased by 
reducing the concentration of salt in the hybridization or wash buffer, such as, for 
example, by reducing the concentration of SSC. Alternatively, or in addition, the 

25 stringency is increased, by increasing the concentration of detergent (eg. SDS). 
Alternatively, or in addition, the stringency is increased, by increasing the temperature 
of the hybridization or wash. For example, a moderate stringency can be performed 
using 0.2xSSC to 2xSSC buffer, 0.1% (w/v) SDS, at a temperature of about 42°C to 
about 65°C. Similarly, a high stringency can be performed using O.lxSSC to 0.2xSSC 

30 buffer, 0.1% (w/v) SDS, at a temperature of at least 55°C. Conditions for performing 
nucleic acid hybridization reactions, and subsequent membrane washing, are well 
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understood by one normally skilled in the art. For the purposes of further clarification 
only, reference to the parameters affecting hybridization between nucleic acid 
molecules is found in Ausubel et al, In: Current Protocols in Molecular Biology, 
Greene/Wiley, New York USA, 1992, which is herein incorporated by reference. 

5 

A number of mapping methods for determining useful loci and estimating their effects 
have been described (eg. Edwards et al, Genetics 116, 113-125, 1987; Haley and 
Knott, Heredity 69, 315-324, 1992; Jiang and Zeng, Genetics 140, 1111-1127, 1995; 
Lander and Botstein, Genetics 121, 185-199, 1989; Jansen and Stam, Genetics 136, 

10 1447-1455, 1994; Utz and Melchinger, In: Biometrics in Plant Breeding: Applications 
of Molecular Markers. Proc. Ninth Meeting of the EUCARPIA Section Biometrics in 
Plant Breeding, 6-8 July 1994, Wageningen, The Netherlands, (J.W. van Ooijen and J. 
Jansen, eds), ppl95-204, 1994; Zeng, Genetics 136, 1457-1468, 1994). In the present 
context, these methods are applied to identify the major component(s) of the total 

15 genetic variance that contribute(s) to the variation in transpiration efficiency of a plant, 
such as, for example, determined by the measurement of carbon isotope discrimination 
(A). More particularly, the segregation of known markers is used to map and/or 
characterize an underlying locus associated with transpiration efficiency. The locus 
method involves searching for associations between the segregating molecular markers 

20 and transpiration efficiency in a segregating population of plants, to identify the linkage 
of the marker to the locus. 

To discover a marker/locus linkage, a segregating population is required. Experimental 
populations, such as, for example, an F2 generation, a backcross (BC) population, 

25 recombinant inbred lines (RIL), or double haploid line (DHL), can be used as a 
mapping population. Bulk segregant analysis, for the rapid detection of markers at 
specific genomic regions using segregating populations, is described by Michelmoore 
et al, Proa Natl Acad Set (USA) 88, 9828-9832, 1991. In the case of F2 mapping 
populations, F2 plants are used to determine genotype, and F2 families to determine 

30 phenotype. Recombinant inbred lines are produced by single-seed descent. 
Recombinant inbred lines, such as, for example, the F9 RILs of A. thaliana (eg. Lister 
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and Dean, Plant J., 4, 745-750, 1993) will be known to those skilled in the art. Near 
isogenic lines (NILs) are used for fine mapping, and to determine the effect of a 
particular locus on transpiration efficiency. An advantage of recombinant inbred lines 
and double haploid lines is that they are permanent populations, and as a consequence, 
5 provide for replication of the contribution of a particular locus to the transpiration 
efficiency phenotype. 

As for statistical methods, Single Marker Analysis (Point Analysis) is used to detect a 
locus in the vicinity of a single genetic marker. The mean transpiration efficiencies of a 

10 population of plants segregating for a particular marker, are compared according to the 
marker class. The difference between two mean transpiration efficiencies provides an 
estimate of the phenotypic effect of substituting one allele for another allele at the 
locus. To determine whether or not the inferred phenotypic effect is significantly 
different from zero, a simple statistical test, such as t-test or F-test, is used. A 

15 significant value indicates that a locus is located in the vicinity of the marker. Single 
point analysis does not require a complete molecular linkage map. The further the locus 
is from the marker, the less likely it is to be detected statistically, as a consequence of 
recombination between the marker and the gene. 

20 In the Anova, t-test or GLM approach, the association between marker genotype and 
transpiration efficiency phenotype comprises: 

(i) classifying progeny of a segregating population of plants by marker genotype, 
such as for example, using RFLP, AFLP, SSCP, or microsatellite analyses, 
thereby establishing classes of plants; 
25 (ii) comparing the mean transpiration efficiencies of classes of plants in the 
segregating population, using a t-test, GLM or ANOVA; and 
(iii) determining the significance of the differences in the mean at (ii), wherein a 
significant difference indicates that the marker is linked to the locus for 
transpiration efficiency. 
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As will be known to those skilled in the art, the difference between the means of the 
classes provides an estimate of the effect of the locus in determining the transpiration 
efficiency of a class. 

5 In the regression approach, the association between marker genotype and phenotype is 
determined by a process comprising: 

(i) assigning numeric codes to marker genotypes; and 

(ii) determining the regression value r for transpiration efficiency against the codes, 
wherein a significant value for r indicates that the marker is linked to the locus 

10 for transpiration efficiency, and wherein the regression slope gives an estimate 

of the effect of a particular locus on transpiration efficiency. 

For QTL interval mapping, the Mapmaker algorithm developed by Lincoln et al, 
Constructing genetic linkage maps with MAPMAKER/EXP version 3.0: A tutorial and 

15 reference manual. Whitehead Institute for Biomedical Research, Cambridge, MA, 
USA, 1993, can be used. The principle behind interval mapping is to test a model for 
the presence of a QTL at many positions between two mapped marker loci. This model 
is a fit of a presumptive QTL to transpiration efficiency, wherein the suitability of the 
fit is tested by determining the maximum likelihood that a QTL for transpiration 

20 efficiency lies between two segregating markers. For example, in the case of a QTL 
located between two segregating markers, the 2-loci marker genotypes of segregating 
progeny will each contain mixtures of QTL genotypes. Accordingly, it is possible to 
search for loci parameters that best approximate the distribution in transpiration 
efficiency for each marker class. Models are evaluated by computing the likelihood of 

25 the observed distributions with and without fitting a QTL effect. The map position of a 
QTL is determined as the maximum likelihood from the distribution of likelihood 
values (LOD scores: ratio of likelihood that the effect occurs by linkage: likelihood that 
the effect occurs by chance), calculated for each locus. 

30 Interval mapping by regression (Haley and Knott., Heredity 69, 315-324, 1992) is a 
simplification of the maximum likelihood method supra wherein basic QTL analysis or 
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regression on coded marker genotypes is performed, except that phenotypes are 
regressed on the probability of a QTL genotype as determined from the linkage 
between transpiration efficiency and the nearest flanking markers. In most cases, 
regression mapping gives estimates of QTL position and effect that are almost identical 
5 to those given by the maximum likelihood method. The approximation deviates only at 
places where there are large gaps, or many missing genotypes. 



In the composite interval mapping (CIM) method (Jansen and Stam, Genetics 136, 
1447-1455, 1994; Utz and Melchinger, 1994, supra, Zeng, Genetics 136, 1457-1468, 
10 1994), the analysis is performed in the usual way, except that the variance from other 
QTLs is accounted for by including partial regression giving more power and precision 
than simple interval mapping, because the effects of other QTls are not present as 
residual variance. CIM can remove the bias that can be caused by the QTLs that are 
linked to the position being tested. 

15 

Publicly available software are used to map a locus for transpiration efficiency. Such 
software include, for example, the following: 

(i) MapMaker/QTL (ftp://genome.wi.mit.edu/pub/mapmaker3/), for analyzing F2 
or backcross data using standard interval mapping; 
20 (ii) MQTL, for composite interval mapping in multiple environments or for 
performing simple interval mapping using homozygous progeny (eg. double 
haploids, or recombinant inbred lines); 

(iii) PLABQTL (Utz and/Melchinger, PLABlocus Version 1.0. A computer program 
to map QTL, Institut fur Pflanzenzuchtung, Saatgutforschung und 

25 Populationsgenetik, Universitat Hohenheim, 70593 Stuttgart, Germany, 1995; 

http://www.uni-hohenheim.deMpspwww/soft.html) for composite interval 
mapping and simple interval mapping of a locus in mapping populations 
derived from a bi-parental cross by selfing, or in double haploids; 

(iv) QTL Cartographer (http://statgen.mcsu.edu/qtlcart/cartographer.html) for single- 
30 marker regression, interval mapping, or composite interval mapping, using F2 

or backcross populations; 
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(v) MapQTL (http:/Avww.cpro.dlo.nl/cbw/); Qgene for performing either single- 
marker regression or interval regression to map loci; and 

(vi) SAS for detecting a locus by identifying associations between marker genotype 
and transpiration efficiency by a single marker analysis approach such as 

5 ANOVA, t-test, GLM or REG. 

In a particularly preferred embodiment, QTL cartographer or MQTL is used to identify 
a locus associated with the transpiration efficiency of plants. 

10 Those skilled in the art will also be aware that it is possible to detect multiple 
interacting alleles or genes for a particular trait, such as, for example, using composite 
interval mapping approaches. To achieve this end, the composite interval mapping 
may be repeated to look for additional loci. Alternatively, or in addition, two or more 
distinct regions of the genome can be nominated as candidate loci, and a gamete 

15 relationship matrix constructed for each candidate locus, and a 2-locus regression 
performed for each pair of loci, determining a best fit for the interacting effects 
between the two loci or aleles at those loci, including any dominance or additive 
effects. The algorithm described by Carlborg et aL, Genetics (2000) can be used for 
simultaneous mapping. In the present context, such an analysis is performed with 

20 reference to the segregation of transpiration efficiency phenotypes in the segregating 
population. 

Use of the ERECTA locus to enhance transpiration efficiency of plants 
As will be known to those skilled in the art, a single locus, if present in the genome of a 
25 plant, can have a significant influence on the phenotype of the plant. For example, 
Grandillo et al 9 Theor. Appl Genet 99, 978-987, 1999, showed that for tomato a 
selection made from a total 28 loci determining fruit size and weight explained 20% of 
the total phenotypic variance in this trait. 

30 Accordingly, a second aspect of the invention provides a method of selecting a plant 
having enhanced transpiration efficiency, comprising: 
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(a) identifying a locus associated with genetic variation in transpiration efficiency in a 
plant; and 

(b) selecting a plant that comprises or expresses a gene that maps to the locus. 

5 By "enhanced transpiration efficiency" is meant that the plant loses less water per unit 
of dry matter produced, or alternatively, produces an enhanced amount of dry matter 
per unit of water transpired, or alternatively, fixes an increased amount of carbon per 
unit water transpired, relative to a counterpart plant. By "counterpart plant" is meant a 
plant having a similar or near-identical genetic background, such as, for example, a 

10 near-isogenic plant, a sibling, or parent. 

In accordance with this aspect of the invention, a locus is identified by conventional 
locus mapping means, and/or by homology searching for genes that map to the 
ERECTA locus on chromosome 2 of the A. thaliana genome, such as, for example, by 
15 searching for ERECTA alleles or erecta alleles from a variety of plants, such as, for 
example, rice, wheat, sorghum, and maize, as described herein above. 

Preferably, to select a plant that comprises or expresses the appropriate gene, marker- 
assisted selection (MAS) is used. As will be known to those skilled in the art, once a 
20 particular locus has been identified, genetic or physical markers that are linked to the 
locus can be readily identified and used to confirm the presence of the locus in breeding 
populations. For a locus that is flanked by two tightly-linked markers that recombine 
only at a low frequency, the presence of the flanking markers is indicative of the 
presence of the locus. 

25 

For marker-assisted selection, it is preferred that the marker is a genetic marker (eg. a 
gene or allele), or a physical marker (eg. leaf hairiness or pod shape), or a molecular 
marker such as, for example, a restriction fragment length polymorphism (RFLP), a 
restriction (RAPD), amplified fragment length polymorphism (AFLP), or a short 
30 sequence repeat (SSR) such as a microsatellite, or SNP. It is also within the scope of 
the invention to utilize any hybridization probe or amplification primer comprising at 
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least about 10 nucleotides in length derived from a chromosome region that is linked in 
the genome of a plant to an ERECTA locus, as a marker to select plants. Those skilled 
in the art will readily be able to determine such probes or primers based upon the 
disclosure herein, particularly for those plant genomes which may have sufficient 
5 chromosome sequence in the region of interest in the genome (eg. A, thaliana, rice, 
cotton, barley, wheat, sorghum, maize, tomato, etc). 

For flanking markers that are not tightly linked, such that there is a large recombination 
distance there between, the presence of the appropriate gene is assessed by identifying 

10 those plants having both flanking markers and then selecting from those plants having 
an enhanced transpiration efficiency. Naturally, the greater the distance between two 
markers, the larger the population of plants required to identify a plant having both 
markers, the intervening locus and a gene within said locus. Those skilled in the art will 
readily be able to determine the population size required to identify a plant having a 

15 particular transpiration efficiency, based upon the recombination units (cM) between 
two markers. 

Transpiration efficiency is determined by any means known to the skilled artisan. 
Preferably, transpiration efficiency is determined by measuring dry matter 
20 accumulation in the plant by gravimetric means, or by measuring water loss, or the 
ratio of C0 2 assimilation rate to stomatal conductance. 

In a particularly preferred embodiment, the transpiration efficiency is determined 
directly, by measuring the ratio of carbon fixed (carbon assimilation rate) to water loss 
25 (transpiration rate). 

In an alternative embodiment, transpiration efficiency is determined indirectly from the 
carbon isotope discrimination value (A). Farquhar et aL, Aust. J. Plant Physiol 9,121- 
137, 1982, showed that carbon isotope discrimination (A; a measure of the extent to 
30 which the 13 C/ 12 C ratio of organic matter is less than that of C0 2 in the source air), is an 
effective indirect measure of transpiration efficiency. Discrimination, (A), is 
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approximately the isotope ratio of carbon in source C0 2 minus that of plant organic 
carbon. In a particular experiment, the source C0 2 is common to all genotypes. The 
determination of transpiration efficiency in this manner is based upon the constancy of 
the atmospheric 13 C: 12 C ratio (about 1.1:98.9) and the finding that ribulose 

5 bisphosphate carboxylase (Rubisco) enzymes discriminate against the use of 13 C. Thus, 
in C 3 plants 13 C0 2 is less efficiently assimilated than 12 C0 2 , and the 13 C0 2 left behind 
tends to diffuse back through stomata in and out of the leaf However, when the 
stomata become nearly closed, the relative back-diffusion of 13 C0 2 is more difficult to 
achieve and the relative intracellular concentrations of 13 C0 2 increases, thereby 

10 increasing the proportion of this isotope that is incorporated into 3-phosphoglycerate, 
and subsequently into dry matter. As a consequence, carbon isotope discrimination (A) 
is greatest when the overall C0 2 assimilation rate during photosynthesis (A) is small, 
and stomatal conductance (g w ) to water vapor is large. This relationship is represented 
by the following algorithm: 

15 A (%o )=27-36A/(g w x C a ) 

wherein C a is the ambient C0 2 concentration (ie. [ 12 C0 2 + 13 C0 2 ]). Discrimination, A , is 
approximately the isotope composition of source C0 2 minus that of plant organic 
carbon. 

20 For a C 3 plant that exhibits a value in the range of about 4.5 %o to about 6.7 %o for the 
term 36A/(g w x C a ), a 1 %o change in carbon isotope discrimination (A) corresponds to a 
change in transpiration efficiency in the range of about 22% to about 15%, respectively. 

The negative relationship between carbon isotope discrimination (A) and transpiration 
25 efficiency has been established for many C 3 plant species, including wheat (Farquhar 

and Richards, Aust J. Plant Physiol. 11, 539-552, 1984; Farquhar et al, Ann Rev. 

Plant Physiol. 40,388-397, 1989), Stylosanthes (Thumma et al t Proc. 9 th Aust. 

Agronomy Conf, Wagga Wagga New South Wales, Australia, 1998), cotton, barley, 

and rice. Accordingly, a lower carbon isotope discrimination (A) value for a test plant 
30 relative to a counterpart plant is indicative of enhanced transpiration efficiency. 
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In C4 species, like maize, coefficients in the equation above are different (Farquhar 
1983, Australian Journal of Plant Physiology, 10:205-226; Henderson et al.,1992, Aust. 
J. Plant Physiol. 19: 263-285) : 
A(%o)=l+5A/(g w xC a ). 
5 A l%o difference in A corresponds to about 38% difference in transpiration efficiency. 
The relationship between A and transpiration efficiency is positive. 13 C preferentially 
accumulates in bicarbonate, the substrate for PEP carboxylation, and so discrimination 
against 13 C is least when A is small and g w is large. However, as C0 2 leakineess from 
the budle sheath increases, C4 plants behave more like C3 plants. 

10 

Alternatively, or in addition, transpiration efficiency is determined by another indicator, 
such as, for example, leaf temperature, ash content, mineral content, or specific leaf 
weight (dry matter per unit leaf area). For example, specific leaf weight is positively 
15 correlated with transpiration efficiency in peanuts and other species (Virgona et al y 
Aust J. Plant Physiol, 17, 207-214, 1990; Wright et al, Crop Sci 34, 92-97, 1994). 
Accordingly, a higher specific leaf weight or higher carbon gain rate for a test plant 
relative to a counterpart plant is indicative of enhanced transpiration efficiency of the 
test plant. 

20 

The presence of the locus can be established by hybridizing a probe or primer that is 
linked to an ERECTA locus, such as, for example, a probe or primer that hybridizes to 
the identified chromosome 2 region of A. thaliana or the identified chromosome 6 
region of rice. 

25 

Preferably, the presence of the locus is established by hybridizing a probe or primer 
derived from any one or more of SEQ ID Nos: 1, 3, 5, 7, 9, 1 1 to 19 or 21 to 44 or from a 
homologous gene in another plant, or a complementary sequence to such a sequence, to 
genomic DNA from the plant, and detecting the hybridization using a detection means. 

30 

\ 
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In one embodiment, detection of the hybridization is performed preferably by labelling 
a probe with a reporter molecule capable of producing an identifiable signal, prior to 
hybridization, and then detecting the signal after hybridization. Preferred reporter 
molecules include radioactively-labelled nucleotide triphosphates and biotinylated 
5 molecules. Preferably, variants of the genes exemplified herein, including genomic 
equivalents, are isolated by hybridisation under moderate stringency or more 
preferably, under high stringency conditions, to the probe. 

Alternatively, or in addition, hybridization may be detected using any format of the 
10 polymerase chain reaction (PCR), including AFLP. For PCR, two non-complementary 
nucleic acid primer molecules comprising at least about 20 nucleotides in length, and 
more preferably at least 30 nucleotides in length are hybridized to different strands of a 
nucleic acid template molecule, and specific nucleic acid molecule copies of the 
template are amplified enzymatically. Several formats of PCR are described in 
15 McPherson et aL, In: PCR A Practical Approach, IRL Press, Oxford University Press, 
Oxford, United Kingdom, 1991, which is incorporated herein by reference. 

For enhancing the transpiration efficiency of a plant wherein the locus is polymorphic, 
such as, for example, an allele, the method supra is modified to include the detection of 
20 the specific allele(s) linked to the desired enhancement. According to this embodiment, 
there is provided a method of selecting a plant having enhanced transpiration 
efficiency, comprising: 

(d) identifying a locus associated with genetic variation in transpiration efficiency 
in a plant; 

25 (e) identifying a polymorphic marker within said locus that is linked to enhanced 
transpiration efficiency; and 
(f) selecting a plant that comprises or expresses the marker. 

Standard means known to the skilled artisan are used to identify a marker within the 
30 locus that is linked to enhanced transpiration efficiency. A population of plants that is 
segregating for the polymorphic marker is generally used, wherein the transpiration 
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efficiency phenotype of plants is then correlated or associated with the presence of a 
particular allelic form of the marker. As exemplified herein, near-isogenic or 
recombinant inbred lines of plants are screened to segregate alleles at the ERECTA 
locus and to correlate enhanced transpiration efficiency with the presence of the 
5 ERECTA allele as opposed to an erecta allele. Alternatively, mutations are introduced 
into an ERECTA allele such as, for example, by transposon mutagenesis, chemical 
mutagenesis or irradiation of plant material, and mutant lines of plants are established 
and screened to segregate alleles at the ERECTA locus that are correlated with the 
genetic variation in transpiration efficiency. 

10 

Suitable markers include any one or more of the markers described herein to be suitable 
for MAS. 

Preferably, the selection of plants in accordance with these embodiments includes the 
15 additional step of introducing the locus or polymorphic marker to a plant, such as, for 
example, by standard breeding approaches or by recombinant means. This may be 
carried out at the same time, or before, selecting the locus or polymorphic marker. 

Recombinant means generally include introducing a gene construct comprising the 
20 locus or marker into a plant cell, selecting transformed tissue and regenerating a whole 
plant from the transformed tissue explant. Means for introducing recombinant DNA 
into plant tissue or cells include, but are not limited to, transformation using CaCk and 
variations thereof, in particular the method described by Hanahan (1983), direct DNA 
uptake into protoplasts (Krens et al 9 Nature 296, 12-1 A, 1982; Paszkowski et aL, 
25 EMBO J. 3, 2717-2722, 1984), PEG-mediated uptake to protoplasts (Armstrong et aL, 
Plant Cell Rep. 9, 335-339, 1990) microparticle bombardment, electroporation (Fromm 
et al t Proc. Natl Acad Sci. (USA), 82, 5824-5828, 1985), microinjection of DNA 
(Crossway etaL 9 MoL Gen. Genet 202, 179-185, 1986), microparticle bombardment of 
tissue explants or cells (Christou et al, Plant Physiol. 87, 671-674, 1988; Sanford, Part. 
30 ScLTechnol. 5, 21-31, 1988), vacuum-infiltration of tissue with nucleic acid, or in the 
case of plants, T-DNA-mediated transfer from Agrobacterium to the plant tissue as 
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described essentially by An et al, EMBO J, 4, 277-284, 1985; Herrera-Estrella et al, 
Herrera-Estella et al, Nature 303, 209-213, 1983; Herrera-Estella et al, EMBO 7. 2, 
987-995, 1983; or Herrera-Estella et al, In: Plant Genetic Engineering, Cambridge 
University Press, N.Y., pp 63-93, 1985. 

5 

For microparticle bombardment of cells, a microparticle is propelled into a cell to 
produce a transformed cell. Any suitable ballistic cell transformation methodology and 
apparatus can be used in performing the present invention. Exemplary apparatus and 
procedures are disclosed by Stomp et al (U.S. Patent No. 5,122,466) and Sanford and 
10 Wolf (U.S. Patent No. 4,945,050). When using ballistic transformation procedures, the 
gene construct may incorporate a plasmid capable of replicating in the cell to be 
transformed. 

Examples of microparticles suitable for use in such systems include 1 to 5 micron gold 
15 spheres. The DNA construct may be deposited on the microparticle by any suitable 
technique, such as by precipitation. 

A whole plant may be regenerated from the transformed or transfected cell, in 
accordance with procedures well known in the art. Plant tissue capable of subsequent 

20 clonal propagation, whether by organogenesis or embryogenesis, may be transformed 
with a gene construct of the present invention and a whole plant regenerated therefrom. 
The particular tissue chosen will vary depending on the clonal propagation systems 
available for, and best suited to, the particular species being transformed. Exemplary 
tissue targets include leaf disks, pollen, embryos, cotyledons, hypocotyls, 

25 megagametophytes, callus tissue, existing meristematic tissue (eg., apical meristem, 
axillary buds, and root meristems), and induced meristem tissue (eg., cotyledon 
meristem and hypocotyl meristem). 

The term "organogenesis", as used herein, means a process by which shoots and roots 
30 are developed sequentially from meristematic centres. 
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The term "embryogenesis", as used herein, means a process by which shoots and roots 
develop together in a concerted fashion (not sequentially), whether from somatic cells 
or gametes. 

The generated transformed plants may be propagated by a variety of means, such as by 
clonal propagation or classical breeding techniques. For example, a first generation (or 
Tl) transformed plant may be selfed to give homozygous second generation (or T2) 
transformant, and the T2 plants further propagated through classical breeding 
techniques. 

The generated transformed organisms contemplated herein may take a variety of forms. 
For example, they may be chimeras of transformed cells and non-transformed cells; 
clonal transformants (eg., all cells transformed to contain the expression cassette); 
15 grafts of transformed and untransformed tissues (eg., in plants, a transformed root stock 
grafted to an untransformed scion ). 

Alternatively, the transformed plants are produced by an in planta transformation 
method using Agrobacterium tumefaciens, such as, for example, the method described 

20 by Bechtold et al, CRAcad. Set (Paris, Sciences de la vie/ Life Sciences) 316, 1194- 
1199, 1993 or Clough et al, Plant J 16: 735-74, 1998, wherein A. tumefaciens is 
applied to the outside of the developing flower bud and the binary vector DNA is then 
introduced to the developing microspore and/or macrospore and/or the developing seed, 
so as to produce a transformed seed. Those skilled in the art will be aware that the 

25 selection of tissue for use in such a procedure may vary, however it is preferable 
generally to use plant material at the zygote formation stage for in planta 
transformation procedures. 



30 



WO 2004/005555 



PCT/AU2003/000854 



52 



Identification of genes for determing the transpiration efficiency of a plant 
As exemplified herein, the inventors also identified specific genes or alleles that are 
linked to the ERECTA locus of A. thaliana, and rice and determine the transpiration 
efficiencies of those plants. More particularly, the transpiration efficiencies of near- 

5 isogenic lines, each carrying a mutation within an ERECTA locus, and a correlation 
between transpiration efficiency phenotype and ERECTA expression or gene copy 
number are determined, thereby providing the genetic contribution of genes or alleles at 
the ERECTA locus to transpiration efficiency. This analysis permits an assessment of 
the genetic contribution of particular alleles to transpiration efficiency, thereby 

10 determining allelic variants that are linked to a particular transpiration efficiency. 
Thus, the elucidation of the ERECTA locus for transpiration efficiency in plants 
facilitates the fine mapping and determination of allelic variants that modulate 
transpiration efficiency. The methods described herein can be applied to an assessment 
of the contribution of specific alleles to the transpiration efficiency phenotype for any 

15 plant species that is amenable to mutagenesis such as, for example, by transposon 
mutagenesis, irradiation, or chemical means known to the skilled artisan for mutating 
plants. 

Accordingly, a third aspect of the invention provides a method of identifying a gene 
20 that determines the transpiration efficiency of a plant comprising: 

(a) identifying a locus associated with genetic variation in transpiration efficiency 



(b) identifying a gene or allele that is linked to said locus, wherein said gene or 
allele is a candidate gene or allele for determining the transpiration efficiency of 



(c) determining the transpiration efficiencies of a panel of plants, wherein not all 
members of said panel comprise or express said gene or allele, and wherein 
variation in transpiration efficiency between the members of said panel 
indicates that said gene is involved in determining transpiration efficiency. 



in a plant; 



25 



a plant; and 
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In another embodiment, the method comprises: 

(a) identifying a locus associated with genetic variation in transpiration efficiency 
in a plant; 

(b) identifying multiple alleles of a gene that is linked to said locus, wherein said 
5 gene is a candidate gene involved for determining the transpiration efficiency of 

a plant; and 

(c) determining the transpiration efficiencies of a panel of plants, wherein each 
member of said panel comprises, and preferably expresses, at least one of said 
multiple alleles, wherein variation in transpiration efficiency between the 

10 members of said panel indicates that said gene is involved in determining 

transpiration efficiency. 

In the present context, the term "near isogenic plants" shall be taken to mean a 
population of plants having identity over a substantial proportion of their genomes, 

15 notwithstanding the presence of sufficiently few differences to permit the contribution 
of a distinct allele or gene to the transpiration efficiency of a plant to be determined by 
a comparison of the transpiration efficiency phenotypes of the population. As will be 
known to the skilled artisan, recombinant inbred lines, lines produced by introgression 
of a gene or transposon followed by several generations of backcrossing, or siblings, 

20 are suitable near-isogeriic lines for the present purpose. 

Preferably, the identified gene or allele identified by the method described in the 
preceding paragraph is selected from the group consisting of ERECTA allele, erecta 
allele, and homologs of ERECTA, wherein said homologs are from plants species other 
25 than A. thaliana. 

In a particularly preferred embodiment, the identified gene or allele will comprise a 
nucleotide sequence selected from the group consisting of: 

(d) a sequence having at least about 55% identity to a sequence selected from the 
30 group consisting of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 

7, SEQ ID NO: 9, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID 
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NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, 
SEQ ID NO: 19 SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID 
NO: 24, SEQ ID NO: 25, SEQ JD NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, 
SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 32 SEQ ID 
5 NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, 

SEQ ID NO: 38; SEQ ID NO: 39, SEQ ID NO: 40, SEQ ID NO: 41, SEQ K> 
NO: 42, SEQ ID NO: 43 and SEQ ID NO: 44;; 

(e) a sequence encoding an amino acid sequence having at least about 55% identity 
to an amino acid sequence selected from the group consisting of SEQ ID NO: 2, 

10 SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 

12, SEQ ID NO: 20 and SEQ ID NO: 45; and 

(f) a sequence complementary to (a) or (b). 



Preferably, the percentage identity is at least about 59-61% or 70% or 80%, more 
15 preferably at least about 90%, and even more preferably at least about 95% or 99%. In 
a particularly preferred embodiment, the identified gene or allele comprises a 
nucleotide sequence selected from the group consisting of: 

(a) a sequence selected from the group consisting of SEQ ID NO: 1, SEQ ID NO: 3, 
SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9, SEQ ID NO: 11, SEQ ID NO: 

20 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID 

NO: 17, SEQ ID NO: 18, SEQ ID NO: 19 SEQ ID NO: 21, SEQ ID NO: 22, 
SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID 
NO: 27, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 31, 
SEQ ID NO: 32 SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID 

25 NO: 36, SEQ ID NO: 37, SEQ ID NO: 38; SEQ ID NO: 39, SEQ ID NO: 40, 

SEQ ID NO: 41, SEQ ID NO: 42, SEQ ID NO: 43 and SEQ ID NO: 44; 

(b) a sequence encoding an amino acid sequence selected from the group consisting 
of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 
10, SEQ ID NO: 12, SEQ ID NO: 20 and SEQ ID NO: 45; and 

30 (c) a sequence complementary to (a) or (b). 
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Enhancement of transpiration efficiency using isolated genes 

The identified gene or alleles, including any homologs from a plant other than A. 
thaliana, such as, for example, the wild-type ERECTA allele, or a homolog thereof is 
useful for the production of novel plants. Such plants are produced, for example, using 
5 recombinant techniques, or traditional plant breeding approaches such as by 
introgression. 

Accordingly, a fourth aspect of the present invention provides a method of enhancing 
the transpiration efficiency of a plant comprising ectopically expressing in a plant an 
10 isolated ERECTA gene or an allelic variant thereof or the protein-encoding region 
thereof. 

Preferably, the ERECTA gene or allelic variant comprises a nucleotide sequence that is 
homologous to a protein-encoding region of a gene that is linked to the A. thaliana 
15 ERECTA locus on chromosome 2. 

In a particularly preferred embodiment, the isolated gene comprises a nucleotide 
sequence selected from the group consisting of: 

(a) a sequence having at least about 55% identity to a sequence selected from the 
20 group consisting of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 

7, SEQ ID NO: 9, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID 
NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, 
SEQ ID NO: 19 SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID 
NO: 24, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, 
25 SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 32 SEQ ID 

NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, 
SEQ ID NO: 38; SEQ ID NO: 39, SEQ ID NO: 40, SEQ ID NO: 41, SEQ ID 
NO: 42, SEQ ID NO: 43 and SEQ ID NO: 44; 

(b) a sequence encoding an amino acid sequence having at least about 55% identity 
30 to an amino acid sequence selected from the group consisting of SEQ ID NO: 2, 
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SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 
12, SEQ ID NO: 20 and SEQ ID NO: 45; and 
(c) a sequence complementary to (a) or (b). 

5 Preferably, the percentage identity is at least about 59-61% or 70% or 80%, more 
preferably at least about 90%, and even more preferably at least about 95% or 99%. 

In a particularly preferred embodiment, the isolated gene or allele comprises a 

nucleotide sequence selected from the group consisting of: 
10 (a) a sequence selected from the group consisting of SEQ ID NO: 1, SEQ ID NO: 3, 
SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9, SEQ ID NO: 11, SEQ ID NO: 
12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID 
NO: 17, SEQ ID NO: 18, SEQ ID NO. 19 SEQ ID NO: 21, SEQ ID NO: 22, 
SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID 
15 NO: 27, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 31, 

SEQ ID NO: 32 SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID 
NO. 36, SEQ ID NO: 37, SEQ ID NO: 38; SEQ ID NO. 39, SEQ ID NO. 40, 
SEQ ID NO: 41, SEQ ID NO: 42, SEQ ID NO: 43 and SEQ ID NO: 44; 

(b) a sequence encoding an amino acid sequence selected from the group consisting 
20 of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO. 6, SEQ ID NO: 8, SEQ ID NO: 

1 0, SEQ ID NO: 12, SEQ ID NO: 20 and SEQ ID NO: 45; and 

(c) a sequence complementary to (a) or (b). 

To ectopically express the isolated gene in a plant, the protein-encoding portion of the 
25 gene is generally placed in operable connection with a promoter sequence that is 
operable in the plant, which may be the endogenous promoter or alternatively, a 
heterologous promoter, and a transcription termination sequence, which also may be 
the endogenous or an heterologous sequence relative to the gene of interest. The 
promoter and protein-encoding portion and transcription termination sequence are 
30 generally provided in the form of a gene construct, to facilitate introduction and 
maintenance of the gene in a plant where it is to be ectopically expressed. Numerous 
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vectors suitable for introducing genes into plants have been described and are readily 
available. These may be adapted for expressing an isolated gene in a plant to enhance 
transpiration efficiency therein. 

5 Reference herein to a "promoter" is to be taken in its broadest context and includes the 
transcriptional regulatory sequences of a classical eukaryotic genomic gene, including 
the TATA box which is required for accurate transcription initiation, with or without a 
CCAAT box sequence and additional regulatory elements (ie. upstream activating 
sequences, enhancers and silencers) which alter gene expression in response to 

10 developmental and/or external stimuli, or in a tissue-specific manner. In the present 
context, the term "promoter" is also used to describe a synthetic or fusion molecule, or 
derivative which confers, activates or enhances expression of said sense molecule in a 
cell. Preferred promoters may contain additional copies of one or more specific 
regulatory elements, to further enhance expression and/or to alter the spatial expression 

15 and/or temporal expression of a nucleic acid molecule to which it is operably 
connected. For example, copper-responsive regulatory elements may be placed 
adjacent to a heterologous promoter sequence driving expression of a nucleic acid 
molecule to confer copper inducible expression thereon. 

20 Placing a nucleic acid molecule under the regulatory control of a promoter sequence 
means positioning said molecule such that expression is controlled by the promoter 
sequence. A promoter is usually, but not necessarily, positioned upstream or 5' of the 
protein-encoding portion of the gene that it regulates. Furthermore, the regulatory 
elements comprising a promoter are usually positioned within 2 kb of the start site of 

25 transcription of the structural protein-encoding nucleotide sequences, or a chimeric 
gene comprising same. In the construction of heterologous promoter/structural gene 
combinations it is generally preferred to position the promoter at a distance from the 
gene transcription start site that is approximately the same as the distance between that 
promoter and the gene it controls in its natural setting, ie., the gene from which the 

30 promoter is derived. As is known in the art, some variation in this distance can be 
accommodated without loss of promoter function. Similarly, the preferred positioning 
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of a regulatory sequence element with respect to a heterologous gene to be placed 
under its control is defined by the positioning of the element in its natural setting, ie., 
the genes from which it is derived. Again, as is known in the art, some variation in this 
distance can also occur. 

5 

Promoters suitable for use in gene constructs of the present invention include those 
promoters derived from the genes of viruses, yeasts, moulds, bacteria, insects, birds, 
mammals and plants which are capable of functioning in plant cells, including 
monocotyledonous or dicotyledonous plants, or tissues or organs derived from such 
10 cells. The promoter may regulate gene expression constitutively, or differentially with 
respect to the tissue in which expression occurs or, with respect to the developmental 
stage at which expression occurs, or in response to external stimuli such as 
physiological stresses, pathogens, or metal ions, amongst others. 

15 Examples of promoters useful in performing this embodiment include the CaMV 35S 
promoter, rice actin promoter, rice actin promoter linked to rice actin intron (PAR-IAR) 
(McElroy et al f Mol and Gen Genetics, 231(1), 150-160, 1991), NOS promoter, 
octopine synthase (OCS) promoter, Arabidopsis thaliana SSU gene promoter, napin 
seed-specific promoter, PcSVMV, promoters capable of inducing expression under 

20 hydric stress, as described by, for example, Kasuga et al 9 Nature Biotechnology, 17, 
287-291, 1999), SCSV promoter, SCBV promoter, 35s promoter (Kay et al, Science 
236, 4805, 1987) and the like. In addition to the specific promoters identified herein, 
cellular promoters for so-called housekeeping genes, including the actin promoters, of 
promoters of histone-encoding genes, are useful. 

25 

The term "terminator" refers to a DNA sequence at the end of a transcriptional unit 
which signals termination of transcription. Terminators are 3 '-non-translated DNA 
sequences containing a polyadenylation signal, that facilitate the addition of a 
polyadenylate sequence to the 3'-end of a primary transcript. Terminators active in 
30 cells derived from viruses, yeasts, moulds, bacteria, insects, birds, mammals and plants 
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are known and described in the literature. They are isolatable from bacteria, fungi, 
viruses, animals and/or plants. 

Examples of terminators particularly suitable for use in the gene constructs of the 
5 present invention include the nopaline synthase (NOS) gene terminator of 
Agrobacterium tumefaciens, the terminator of the Cauliflower mosaic virus (CaMV) 
35S gene, the zein gene terminator from Zea mays, the Rubisco small subunit (SSU) 
gene terminator sequences and subclover stunt virus (SCSV) gene sequence 
terminators, amongst others. 

10 

Those skilled in the art will be aware of additional promoter sequences and terminator 
sequences that may be suitable for use in performing the invention. Such sequences 
may readily be used without any undue experimentation. 

15 Preferably, the gene construct further comprises an origin of replication sequence for its 
replication in a specific cell type, for example a bacterial cell, when said gene construct 
is required to be maintained as an episomal genetic element (eg. plasmid or cosmid 
molecule) in said cell. Preferred origins of replication include, but are not limited to, 
the /7-ori and colEl origins of replication. 

20 

Preferably, the gene construct further comprises a selectable marker gene or genes that 
are functional in a cell into which said gene construct is introduced. 

As used herein, the term "selectable marker gene" includes any gene which confers a 
25 phenotype on a cell in which it is expressed to facilitate the identification and/or 
selection of cells which are transfected or transformed with a gene construct of the 
invention or a derivative thereof. 

Suitable selectable marker genes contemplated herein include the ampicillin resistance 
30 (Amp 1 ), tetracyclin-resistance gene (Tc 1 ), bacterial kanamycin resistance gene (Kan 1 ), 
phosphinothricin resistance gene, neomycin phosphotransferase gene (npiK), 
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hygromycin resistance gene, gentamycin resistance gene (gent), (3-glucuronidase (GUS) 
gene, chloramphenicol acetyltransferase (CAT) gene, and luciferase gene, Green 
Fluorescent Protein gene (EGFP and variants), amongst others. 

5 In a related embodiment, the invention extends to the use of an isolated gene 
comprising a nucleotide sequence that is homologous to a protein-encoding region of a 
gene of A. thaliana that is positioned between about 46cM to about 50.74cM on 
chromosome 2 in the preparation of a gene construct for enhancing the transpiration 
efficiency of a plant. 

10 

In an alternative embodiment of the invention, the transpiration efficiency of a plant is 
enhanced by classical breeding approaches, comprising introgressing the isolated gene 
into a plant. For introgression of a gene, the gene is transferred from its native genetic 
background into another genetic background using standard breeding, for example, a 

15 gene that enhances transpiration efficiency in a progenitor such as a diploid cotton or 
diploid wheat may be transferred into a commercial tetraploid cotton or hexaploid 
wheat, respectively, by standard crossing, followed by several generations of back- 
crossing to remove the genetic background of the progenitor. Naturally, continued 
selection of the gene of interest is required, such as, for example, facilitated by the use 

20 of markers. 

A further aspect of the present invention provides a plant having enhanced transpiration 
efficiency, wherein said plant is produced by a method described herein. 

25 Clearly the ERECT A genes, allelic variants and protein coding regions described 
herein are useful in determining other proteins that are involved in the transpiration 
process in plants. For example, an ERECTA gene, allelic variant thereof or protein 
coding region thereof may be used in a forward 'n' -hybrid assay to determine if said 
peptide is able to bind to a protein or peptide of interest. Forward 'n' hybrid methods 

30 are well known in the art, and are described for example, by Vidal and Legrain Nucl 
Acid Res. 27(4), 919-929 (1999) and references therein, and include yeast two- hybrid, 
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bacterial two-hybrid, mammalian two-hybrid, PolHI (two) hybrid, the Tribrid system, 
the ubiquitin based split protein sensor system and the SOS recruitment system. Such 
methods are incorporated herein by reference 

5 In adapting a standard forward two-hybrid assay to the present invention, an ERECTA 
protein is expressed as a fusion protein with a DNA binding domain from, for example, 
the yeast GAL4 protein. Methods of constructing expression constructs for the 
expression of such fusion proteins are well known in the art, and are described, for 
example, in Sambrook et al (In: Molecular Cloning: A laboratory Manual, Cold Spring 

10 Harbour, New York, Second Edition, 1989). A second fusion protein is also expressed 
in the yeast all, said fusion protein comprising, for example, a protein thought to 
interact with an ERECTA protein, for example the GAM activation domain. These 
two constructs are then expressed in a yeast cell in which, a reporter molecule (e.g., tef , 
Amp\ Rif, bsdf, zeof, Kan r t gfp t cobA, LacZ, TRPl LYS2 t HIS3, HIS5, LEU2, URA3, 

15 ADE2, MET13, METIS) under the control of a minimal promoter placed in operable 
connection with a GAL 4 binding site. If the proteins do not interact, a reporter 
molecule is not expressed. However, if said proteins do interact, said reporter molecule 
is expressed. Accordingly a protein, polypeptide, peptide that is able to specifically 
bind a target protein is identified. 

20 

A forward 'n' -hybrid method may be modified to facilitate high throughput screening 
of a library of peptides, polypeptides and/or proteins in order to determine those that 
interact with an ERECTA protein. Methods of screening libraries of proteins are well 
known in the art and are described, for example, in Scopes Qm Protein Purification: 
25 Principles and Practice, Third Edition, Springer Verlag, 1994). Proteins identified by 
this method are potentially involved in the transpiration process in plants. 

The present invention is further described with reference to the following non-limiting 
examples. 

30 
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EXAMPLE 1 

12 C/ 13 C discrimination as a marker for screening genetic variation 
in transpiration efficiency. 
Experimental conditions and sampling procedures were established to allow the control 
5 of many factors, other than genetic, that influence transpiration efficiency at the level of 
individual leaves and plants. These factors fall into several categories: (a) 
characteristics of the seedling's micro-environment: temperature, light, humidity, 
boundary layer around the leaves, root growth conditions; (b) developmental and 
morphological effects that modify gas exchange and C metabolism and therefore 
10 carbon isotopic signature (eg age, stage, posture); and (c) seed effects. 

We developed high resolution mass-spectrometer techniques for measuring C isotope 
ratios in whole tissues or carbon compounds such as soluble sugars -ie a measure of 
integrated transpiration efficiency over the plant's life or over a day, respectively, and 
15 also for measuring instantaneous transpiration efficiency during gas exchange. 

This means: 

0.1 per mil analytical precision in the measurement of the isotopic composition 
of leaf carbon. Discrimination, (A), is approximately the isotope ratio of carbon 
20 in source C0 2 minus that of plant organic carbon. In a particular experiment, the 

source CO2 is common to all genotypes. 

0.1 per mil biological precision, that is variation between replicated seedlings, 
grown in soil, either in growth chambers or in glasshouses with CO2, humidity 
and temperature control (corresponding to approximately 1.5% variation in 

25 transpiration efficiency). 

The ability to grow and screen large batches of seedlings in glasshouses or 
growth chambers (up to 1500), under standardised leaf and root growth 
conditions, to a rosette size of several cm within 2-3 weeks allowing individual 
measurements, on the same plant, of isotope ratios and also of the underlying 

30 properties (eg in situ measurement of leaf temperature by infra-red thermometry 
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as a measure of stomatal conductance; chlorophyll fluorescence; leaf 
expansion). 

EXAMPLE 2 

5 Natural genetic variation in transpiration efficiency in Arabidopsis thaliana 

A. thaliana ecotypes were screened for leaf A under glasshouse conditions. There was 
a large spread of values (corresponding to approximately 30 % genetic variation in 
transpiration efficiency). However, large environmental effects were noted. A few 
contrasted ecotypes were selected at the two extremes of the range of A values and 

10 compared under various conditions of irradiance (150 to 500 pE m'V 1 ), light spectrum 
(Red/Far-Red ratios) and air humidity (60 to 90%) while roots were always well 
watered. The magnitude of genetic differences in transpiration efficiency was very 
much influenced by environmental conditions. This was in part due to variations among 
ecotypes in the dependence of photosynthesis on light and vapour pressure deficit. 

15 Genetic differences were maximal under a combination of high light and low humidity, 
in growth chambers. 

The ecotypes Columbia (Col) and Landsberg erecta (Ld-er) have extreme carbon 
isotope discrimination values, with Col always having smaller A values than Ld-er ie 
20 less negative 6 13 C isotopic compositions, and thus a greater transpiration efficiency. 

EXAMPLE 3 

Identification of a locus associated with transpiration efficiency in A. thaliana 

25 Quantitative Trait Loci (QTL) analysis of the Lister and Dean's (1993) Recombinant 
Inbred Lines (later referred to as RILs) was performed to identify and map a locus 
associated with carbon isotope discrimination (A). The RELs were from a cross 
between Col-4 and Ler-0. Our analysis showed the importance of genes around the ER 
locus on chr2, and a role for genes other than ERECTA in conferring transpiration 

30 efficiency on A. thaliana 
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More particularly, 300 RI mapping lines between Col and Ler ecotypes, available at the 
Arabidopsis Stock Centre, were generated from a cross between the Arabidopsis 
ecotypes Columbia (Col4) and Landsberg erecta (Ler-0 carrying erJ) (Lister and Dean, 
1993), using Columbia as the male parent. A subset of 100 of these lines, chosen as the 
5 most densely and reliably mapped were used in the present analysis. 

The seeds were multiplied in a glasshouse in an attempt to minimize confounding seed 
effects in our comparisons. Large numbers of seeds were obtained for most lines except 
for a few, including Col4 parent, which had to be re-ordered following low seed 
10 viability of the original sample sent by the Stock Centre. The seeds harvested in these 
propagation runs were used throughout all our experiments to date. 

Loci were analysed using two programs, QTL cartographer and MQTL. These 
programs compute statistics of a trait at each marker position, using a range of methods 

15 [linear regression (LR), stepwise regression (SR), and likelihood approaches (Single 
interval mapping (SIM) which treats values at individual markers as independent 
values, and composite interval mapping (C1M) which allows for interactions between 
markers and associated locus)]. By nature each of these methods has some biases and 
embedded assumptions, hence the importance of analysing data with more than one 

20 program. Only results that were consistent between the two programs, and robust to 
additions or deletions to the set of background markers used for composite interval 
mapping are reported below. 

Initial QTL analysis was done in parallel to seed multiplication on a subset of 40 lines 
25 for which enough seeds were sent. Once all seeds had been multiplied this was repeated 
on the full set of 100 lines. These two analyses indicated the existence of a locus for 
carbon isotope discrimination (A), that maps to the region including the ERECTA locus 
on chromosome 2, at approximately 46-51 cM (Table 1, run 1&2). 

30 Given the complexity and integrative nature of A as a physiological trait, such a small 
number of loci associated with the trait was not expected. Subsequent experiments 
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were therefore designed to test these results and assess their stability across the range of 
environmental conditions known for their effects on gene expression related to A (see 
above). QTL analysis was repeated on several completely independent data sets 
obtained under highly controlled conditions in glasshouses or growth chambers, where 
5 either air humidity, photoperiod or irradiance (amount, diurnal pattern, day to day 
variation) was varied. Depending on the experiment, all 100 recombinants inbred lines 
were included or only the subset of lines with cross-overs on chromosome 2. These 
experiments confirmed that genetic variation in A could be mostly ascribed to a portion 
of chromosome 2 (Table 1) between about 46-50.7 cM. 

10 

When RILs were sorted graphically according to carbon isotope discrimination and 
their genotype at the ER marker (50.64 cM) and its vicinity (Ld-erJ genotype or Col- 
ER genotype), lines which were Ld-er at the ERECTA marker ranked mostly at the high 
end of carbon isotope discrimination values. In contrast, lines having a Col-ERECTA 

15 marker genotype ranked mostly at the low end of carbon isotope discrimination values 
(data available on request). In the middle of the range of carbon isotope discrimination 
values, there was some overlap between the two sets of lines. Some lines were always 
at an extreme (in all 18 experiments performed), while the ranking of other lines was 
more unstable. These data indicate a locus for transpiration efficiency, as determined 

20 by the carbon isotope discrimination value, in the vicinity of the ERECTA locus on 
chromosome 2 (Table 1). This locus most likely involves the ER gene. Depending on 
the positions of cross-overs between Ld-er and Col, recombination between ERECTA 
and one or more of the other genes influences the transpiration efficiency phenotype of 
the progeny. 

25 

EXAMPLE 4 
Transformation protocol for maize 

Gun transformation 

30 A suitable method for maize transformation is based on the use of a particle gun 
identical to that described by J. Finer (1992, Plant Cell Report, 77:323-328). The target 
cells are fast dividing undifferentiated cells having maintained a capacity to regenerate 
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in whole plants. This type of cells composes the embryogenic callus (called type IT) of 
maize. These calluses are obtained from immature embryos of genotype Hill according 
to the method and on medium described by Armstrong (Maize Handbook ; 1994 M. 
Freeling, V. WalbotEds ; pp.665-671). 

5 

These fragments of the calluses having a surface from 10 to 20 mm2 are arranged, 4 
hour before bombardment, by putting 16 fragments by dish in the center of a Petri dish 
containing an culture medium identical to the medium of initiation of calluses, 
supplemented with 0.2 M of mannitol + 0,2 M of sorbitol. Plasmids containing the 
10 ERECTA sequences to be introduced, are purified on QiagenR column following the 
instructions of the manufacturer. 

They are then precipitated on particles of tungsten (M10) following the protocol 
described by Klein et al t Nature, 327, 70-73, (1987). Particles so coated are sent 
15 towards the target cells by means of the gun according to the protocol described by 
Finer et al t Plant Cell Report, 77:323-328, 1992. The bombarded dishes of calluses are 
then sealed by means of ScellofraisR then cultivated in the dark at 27°C. 

The first transplanting takes place 24 hours later, then every other week during 3 
20 months on medium identical to the medium of initiation supplemented with a selective 
agent. After 3 months or sometimes earlier, one can obtain calluses the growth of 
which is not inhibited by the selective agent, usually and mainly consisting of cells 
resulting from the division of a cell having integrated into its genetic patrimony one or 
several copies of the gene of selection. The frequency of obtaining of such calluses is 
25 about 0,8 callus by bombarded dish. 

These calluses are. identified, individualized, amplified then cultivated so as to 
regenerate seedlings, by modifying the hormonal and osmotic equilibrium of the cells 
according to the method described by Vain and al. (1989, Plant Cell tissue and organ 
30 Culture 18:143-151). These plants are then acclimatized in greenhouse where they can 
be crossed for obtaining hybrids or self-fertilized. 
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In a preferential way, one can use a similar protocol, the principle of which is described 
in Methods of Molecular Biology: Plant gene transfer and expression protocols (1995, 
vol 49, PP1 13-123), and in which the immature embryos of genotype Hill are directly 
5 bombarded with golden particles coated with plasmides ERECTA to introduce, 
prepared according to the protocol described by Barcelo and Lazzeri (1995, Methods of 
Molecular Biology 49: 1 13-123). 

Steps of transformation, selection of the events, maturation and regeneration are similar 
10 to those described in the previous protocol. 

Agrobacterium transformation 

Another technique of useful transformation within the framework of the invention uses 
Agrobacterium tumefaciens, according to the protocol described by Ishida and al (1996, 
15 Nature Biotechnology 14 : 754-750), in particular starting from immature embryos 
taken 10 days after fertilization. 

All the used media are referenced in the quoted reference. The transformation begins 
with a phase of co-culture where the immature embryos of the maize plants are put in 
contact during at least 5 minutes with Agrobacterium tumefaciens LBA 4404 
20 containing the superbinary vectors. 

The superbinary plasmid is the result of an homologous recombination between an 
intermediate vector carrying the T-DNA, and containing the gene of interest and/or the 
marker gene of selection, and the vector pSBl of Japan Tobacco (EP 672 752) 
25 containing: the virB and virG genes of the plasmide pTiBo542 present in the 
supervirulent strain A281 of Agrobacterium tumefaciens (ATCC 37349) and an 
homologous region found in the intermediate vector, allowing homologous 
recombination. 

30 Embryos are then placed on LSAs medium for 3 days in the dark and at 25°C. A first 
selection is made on the transformed calluses: embryogenic calluses are transferred on 
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LSD5 medium containing phosphinotricine (5 mg / 1) and cefotaxime (250 mg / 1) 
(elimination or limitation of contamination by Agrobacteriian tumefaciens). 

This step is performed during 2 weeks in the dark and at 25°C. The second step of 
5 selection is realized by transfer of the embryos which developed on LSD5 medium, on 
LSD10 medium (phosphinotricine, 10 mg / 1) in the presence of cefotaxime, during 3 
weeks at the same conditions as previously. The third stage of selection consists in 
excising the calluses of type I (fragments from 1 to 2 mm) and in transferring them for 
3 weeks in the darkness and at 25°C on LSD 10 medium in the presence of cefotaxime. 
10 The regeneration of seedlings is made by excising the calluses of type I which 
proliferated and by transferring them on LSZ medium in the presence of 
phosphinotricine (5 mg / 1) and of cefotaxime for 2 weeks at 22°C and under 
continuous light. 

15 Seedlings having regenerated are transferred on RM medium + G2 containing 
Augmentin (lOOmg / 1) for 2 weeks at 22°C and under continuous illumination for the 
development step. The obtained plants are then transferred to the phytotron with the 
aim of acclimatizing. 

20 EXAMPLE 5 

Detecting expression of ERECT A protein 

Extraction of ERECT A from leaves and seeds of maize. 

Leaves are harvested and immediately frozen in liquid nitrogen. Grinding is made in a 
25 mortar cleaned in ethanol 100 % and cooled on ice. A foliar disc of 18 mm diameter is 
extracted in 200 nL of extraction buffer: Tris-HCl pH 8.0, glycerol 20%, MgC12 10 
mM, EDTA 1 mM, DTT 1 mM, PVP insoluble 2% (p/v), Fontainebleau sand et 
protease inhibitors: leupeptin 2mg/L, chymostatin 2mg/L, PMSF ImM and E64 lmg/L. 
The ground material is then centrifuged in 4°C during 15 minutes at 20000g to 
30 eliminate fragments. 
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Grains are first reduced to powder in a bead-crusher (Retsch). Proteins are extracted by 
suspending IOOjiL of powder in 400 \iL of the previously described buffer on ice. This 
mixture is vortexed and centrifiiged at 4°C during 15 minutes et 20000g to eliminate 
fragments. 

ERECTA protein levels are then measured using techniques known to those skilled in 
the art, and described, for example, in Scopes (In: Protein purification: principles and 
practice, Third Edition, Springer Verlag, 1994). 



10 EXAMPLE 6 

Determination of a role for the ERECTA gene in regulating transpiration efficiency 

• «. 

We compared Col and Ler ecotypes with near-isogenic mutant lines for the erecta 
gene, to examine a possible role of the ERECTA gene in determining carbon isotope 
15 discrimination (A). 

Plants expressing the wild type ERECTA gene (SEQ ID NO: 1), or an erecta mutant 
allele in the Columbia background (eg. Col-erl, Col-er2, CoUerlOl to -er!05; or Col- 
erl06 to -erl23) and in Landsberg background (Ld-erl) have been publicly described. 
20 Three of these mutants were available for comparison to the isogenic or near-isogenic 
lines (Table 2). 

Col4 (ER) and Ld-er7, the parental lines for Lister and Dean's RILs were 
systematically included in the comparison. Where possible, other Col "ecotypes" were 
25 also included, (eg. ColO, Coll, Col3-7), to assess their similarity with respect to carbon 
isotope discrimination, especially compared to the RIL parental ecotype Col4. 

The results of these comparisons are described in Table 3. Data indicate the differences 
in carbon isotope discrimination values between er and ER lines for 15 different 
30 experimental runs corresponding to growth under low to high light (100 to 800 fiE m" 2 
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s" 1 ), low to high humidity (40 to 85%), short to long days (8, 10, 24hrs), normal to high 
temperatures (22/20°C to 28/20 °C). 

As expected, the spread of carbon isotope discrimination values among lines varied 
5 with environmental conditions. Lines carrying er mutations have a greater carbon 
isotope discrimination value overall than those having the ER wild type gene (see Table 
3, column 1), indicative of a lower water use-efficiency. There is usually little 
difference in C isotopic discrimination between the various Col lines, (see the similar 
averages obtained for columns 2, 3, and 4 in Table 3, wherein erl05 is compared to 3 

10 different Col ecotypes, ColO, Col4 and 3176 or Coll). When present, the erl05 mutant 
always has the greatest carbon isotope discrimination value of all lines, including erl 
and er2 (columns 2-4 compared to columns 5-6 in Table 3, or column 8 compared to 
column 9 in Table 3). The value measured in the erlOS mutant is always significantly 
greater than in the ER isogenic line (column 4 in Table 3). The value measured in erl 

15 (Landsberg parental line NW20) is usually also greater than that in the ER lines 3177 
(near isogenic, column 6 of Table 3), and to a lesser extent Col4 (Columbia parental 
line, column 7 of Table 3). These observations give direct evidence that the ERECTA 
gene plays a significant role in determining genetic differences in carbon isotopic 
discrimination in Arabidopsis. 

20 

This conclusion is independently confirmed by leaf gas exchange measurements that 
allow the direct measure of transpiration efficiency (ratio of net CO2 fixation to water 
loss; column 4 in Table 4; Figures la-lc, 2a-2c). Measurements on mature leaves 
reveal that ER lines are characterised by a greater ratio of CO2 assimilation to water 

25 loss than lines carrying er mutations. This is most obvious when comparing the pair 
Coll/erl05 with a 21% greater transpiration efficiency (ratio A/E) in Coll than erlOS, 
or the pair Coll/er2 with a 16% greater transpiration efficiency in Coll. Consistent 
with the measurements of carbon isotope discrimination, the effect er/ER is relatively 
smaller in the Ld background (9% greater ratio A/E in Ld-ER (3 177) than in the Ld-erl 

30 (NSW20) line). 
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Also consistent with the carbon discrimination measurements, is the 20% difference in 
transpiration efficiency between the two RILs parental lines (4.06 and 3.38 
mmolC/molH20 in Col4-ER and Ld-erl, respectively). 

5 The fact that of all 3 erecta mutants examined, erlOS has the most extreme carbon 
discrimination and transpiration efficiency phenotypes suggests that the erlOS mutation 
affects a more crucial part of the ERECTA gene than er2 or erl. This is consistent with 
the published data on the erlOS mutant. This mutation corresponds to the insertion of a 
large "foreign insert" in the ERECTA gene. The insertion inhibits transcription of the 

10 gene and causes the strongest erecta phenotype of all erecta mutants isolated in Col 
(with respect to inflorescence clustering and silique width and shape). Alternatively, or 
in addition, data indicate that erecta mutations have a stronger effect on carbon isotope 
discrimination values in a Columbia genetic background than in a Landsberg 
background (comparison of phenotypic effects of erl 05 and erl\ implying that other 

15 genes, polymorphic between Landsberg and Columbia ecotypes, interact with ERECTA 
in determining transpiration efficiency. This could also account for the greater 
difference in transpiration efficiency between er/ER lines in Col background than in a 
Ld background (see above, Table 4). Alternatively, or in addition, data indicate that the 
erecta mutation is not the only mutation present in the erlOS mutant. For example, the 

20 mutagenized Col seeds may have carried the gll mutation, induced by the fast neutron 
irradiation, that also contributes to the phenotype observed. 

A comparison of transcript profiles in er/ER isogenic lines (in both Col and Ld 
background) allows determination of the involvement of additional genes to ERECTA 
25 and the effect of environment on their expression. 



30 
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EXAMPLE 7 

QTL detection centred on iheERECTA marker and ERECT A gene locus on 
chromosome 2 of Arabidopsis thaliana 

5 1. Methods 

Numerous runs using the Lister and Dean (1993) Recombinant Inbred Lines between 
Col-4 and Ler-0 were grown in a temperature controlled glass house (20/20°C) or 
within growth cabinets (21°C and light levels ranging from 100 to 500 pE in 2 s" 1 
irradiance, and 50-70% relative humidity). Runs included a variety of all 100 RDLs as 

10 well as subsets of these 100 along with parental Col-4 and Ler-0 (NW20) parental lines. 
Individual RILs were replicated within runs. Seeds were either cold-treated on moist 
filter paper for 2-4 days, cold-treated and planted directly onto soil; or plated onto agar, 
cold-treated for 2-4 days, grown on agar for about 11-15 days before being transferred 
to soil. Plants were well-watered, and grown for 4-5 weeks before harvest. Samples 

15 (whole or part of rosette) were collected and dried in an 80°C oven before being ground 
and analysed for C isotopic composition. The value used for QTL analysis for an 
individual line was the average of the replicated plants of that line within one run. 

2. Marker Selection 
20 The standard set of 64 markers for the Lister and Dean recombinant lines were down- 
loaded from the NASC website. Additional markers were added to this data set when 
significance was first determined to get finer scale mapping in the regions of interest. A 
total of 121 markers were used across the 5 chromosomes. 

25 3. Analysis 

Rims were analysed using Simple Interval Mapping (SIM)(Lander & Botstein 1989) 
and Composite Interval Mapping (CIM) (Zeng 1993 & 1994). Two programs were used 
to analyses the data, QTL Cartographer version 1.14 (Basten et al. 1999) and MQTL 
version 0.98 (Tinker and Mathers 1995). The two programs differ in how they deal 
30 with background markers for Composite Interval Mapping (CIM). Within MQTL the 
background markers are chosen at random and put into the map input file. Within QTL 



WO 2004/005555 




PCT/AU2003/000854 



73 

Cartographer the background markers are not chosen at random but rather are chosen 
from the Stepwise Regression analysis selecting the "best" background markers. The 
setting or choosing of these markers also has an influence on the level of statistical 
significance. Tinker argues that it is not possible to find an appropriate threshold for 
5 statistical error control when background markers are selected based on the data. Hence 
we used the two programs and have concentrated on QTLs that were present in both 
sets of analysis. 

a) OTL cartographer 

10 Qstat was used to determine whether the data had a normal distribution (if not then 
measures were taken to fix the distribution). Linear Regression (LR) and Stepwise 
Regression (SR) were performed using the default settings (Stepwise regression used 
forward with backward elimination) 5% significance. Simple Interval and CIM were 
performed using the Zmap.qtl function. The data were analysed across all 

15 chromosomes with a walking speed of 2 cM. For model 6 (CIM) the number of 
background parameters was left at the default of 5 along with the window size which 
was left on the default of 10 cM. One thousand permutations were performed within 
CIM (Churchill and Doerge 1994). Eqtl was then run to determine the significant 
QTLs. 

20 

b^MOTL 

The same set of markers used in QTL Cartographer was used in MQTL. Background 
markers were chosen at random for CIM. The number of markers chosen was 
approximately half that of the number of RJDLs used in the set. The default setting of a 
25 walking speed of 5 cM was selected, 3000 permutations were performed to determine 
significance levels with type 1 error set at 5% . 

QTLs that were present in both programs and from varied background marker sets from 
within MQTL were considered genuine. This, coupled with repeated QTL analysis 
30 across independent experiments, lead to a significant repeatable locus surrounding the 
ERECTA gene on Chromosome 2 (Table 5). 
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Data in Table 5 indicate that there is a major QTL with a LOD score significant at the 
probability level of 5% and, for most runs, of even 1%, on Arabidopsis thaliana 
chromosome 2. In all cases, that interval sits above the ER marker on chromosome 2. 
5 Depending on the experimental run, this QTL explains 18 to 64% (see column R 2 ) of 
the total genetic variance in transpiration efficiency. 

Data in Figure 3 indicate a positive additive effect of the identified QTL based upon the 
mean value of the carbon isotope composition in plants carrying the Col-4 ERECTA 
10 allele. 

EXAMPLE 8 

Complementation Test: transformation of A. thaliana lines carrying erecta mutations 
with the wild type ERECTA gene under the control of the 35S promoter. 

15 

L Methods 

Two Columbia erecta lines were transformed using a binary vector generously given 
by Dr Keiko Torii. That plasmid was constructed using the vector plasmid pPZP222 
(see details on this vector in Hajdukiewics et al Plant Mol Biol 25, 989-994, 1994). 

20 The pPZP vectors carry chimeric genes in a CaMV 35S expression cassette that confer 
resistance to kanamycin or gentamycin in plants. The plant selectable marker 
(gentamycin resistance gene for the pPZP222 vector) is cloned next to the LB. Cloning 
sites for the gene of interest (ER in our case) is between the plant marker and the KB 
sequences. This ensures that that gene is transferred to plant first, followed by the gent 

25 gene. Resistance to gentamycin will therefore be obtained only if the ER gene is also 
present. 

The binary vector was transferred to disarmed strain AGL1 of Agrobacterium 
tumefaciens by standard tri-parental matings (Ditta et al, 1980, PNAS 77,7347-7351) 
30 using the pRK2013 helper strain of E coli. 
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Arabidopsis plants were transformed using the standard floral dip method for 
transformation by disarmed strains of A tumefaciens (Clough and Bent, 1998, The 
Plant Journal 16, 735-743). 

5 Two Columbia erecta lines were transformed, for which we had numerous data 
showing consistently more negative isotopic values in those lines (ie lower 
transpiration efficiency) than in near-isogenic Col ER wild type plants. These two lines 
were as follows: 

1. er 105, a knock-out mutant due to the insertion of a large piece of DNA in the 
10 ERECTA gene and 

2. line Col-er2 (3401 NASC identifier), same as erl06 (Lease et al2001). 

Seedlings were screened on MS plates on 100 ^tg/ml gentamycin sulfate. Putative 
transformants were transferred to soil and their progeny screened again for gentamycin 
15 resistance, for confirmation and identification of homozygous lines and T3 seed 
collection. 

Many independent transformant lines were obtained and among those were several ER 
homozygous lines, which were selected for subsequent analysis (see Table 6). 

20 

A stable transgenic homozygous Landsberg ER line also obtained by transforming the 
Ld-erl ecotype (NW20) with the same construct as described above was given to us by 
Dr Keiko Torii (line T3-7K in Table 6 or ,, T2+ER n in Figures 6-9). 

25 2. Results: 

Initial analysis of several ER transformants in the Col-erl05, Col-erl06/er2, and Ld-erl 
background (as shown in Table 6 above): 

Effective transformation was ascertained and ER expression levels were quantified in 
30 several independent T2 transformants using real-time quantitative PCR (ABI PRISM 
7700, Sequence Detection System User Bulletin #2. 1997). Basically that technique 
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allowed us to quantify the copy number of the ER gene in lines of interest, after 
normalisation to the copy number of a control gene, in the same plants (same RNA 
pool). 18S ribosomal RNA gene was used as a control gene after checking its 
expression was not affected by changes in ER expression. 

5 

Results are shown in Figures 4a, 4b and 4c, wherein the y-axis in each figure describes 
the erecta mRNA copy number (normalised to that of 18S mRNA) in wild type ER 
lines, er mutants, and ER transgenics in both Columbia and Landsberg backgrounds. 

10 All ER transgenic lines, except line 145 (Figure 4a) showed increased mRNA copy 
number: from 4 to 170 fold increase compared with the null controls. Interestingly, all 
lines, even those with hugely increased mRNA levels look "normal", healthy and of 
similar size. 

15 Initial phenotypic analysis shows complementation of the "transpiration efficiency 
phenotype". In other words, ER transgenic lines show less negative carbon isotopic 
composition values than null er control and null lines as shown in Table 7. Those 
values converge towards values measured for wild type ER ecotypes. Hence in a 
Columbia background, ER transgenics display values of -30.6 to -31.2 per mil on 

20 average compared to values of -31.7 to -32.2 per mil in the null transgenics (Table 7), 
and -30.9 per mil in the ColO ER wild type (background ecotype for mutant er-105). 
The less negative carbon isotopic compositions in ER transgenics is indicative of 
greater transpiration efficiency in these plants, as expected. 

25 The data presented in Table 7 are confirmed by direct measurement of leaf transpiration 
efficiency (ratio A/E of C0 2 assimilation rate per unit leaf area to transpiration rate) 
using gas exchange techniques. Stomatal density, leaf photosynthetic capacity and 
growth rate are also determined to analyze the underlying causes of the reversion of the 
transpiration efficiency phenotype (leaf development and anatomy, biochemical 

30 properties of leaves, stomatal characteristics). 
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EXAMPLE 9 

Tissue specificity in the expression of the ERECTA gene in wild type rice 
Oryza sativa (cv Nipponbare): 
5 An ortholog of the A. thaliana ERECTA allele (SEQ ID NO: 1) in rice was identified in 
silico by homology searching of the NCBI protein database using the BLAST 
programme under standard conditions. The input sequence was SEQ ID NO: 2. The 
nucleotide sequence of the rice ortholog is presented in SEQ ID NO: 3, with the 
encoded protein comprising the amino acid sequence set forth in SEQ ID NO: 4. 

10 

The mRNA copy number of the rice ERECTA gene was determined for various plant 
organs/parts, as indicated in Figures 5a and 5b. ERECTA mRNA copy numbers were 
determined by quantitative real-time PCR, using 18S mRNA as an internal control gene 
for normalization of data. The pattern of ERECTA expression in rice was similar to the 
15 pattern of gene expression in A. thaliana, with highest expression observed in young 
meristematic tissues, young leaves and even more, the inflorescences. No or very low 
expression is found in roots, as for A thaliana. 

These similarities in tissue specificity between rice and Arabidopsis indicates that the 
20 rice orthologue provided herein as SEQ ID NO: 3 is a true orthologue of the A. thaliana 
ERECTA allele set forth in SEQ ID NO: 1, with similar function 

EXAMPLE 10 

25 Demonstration of a functional role for the rice ERECTA gene 

in modulating transpiration efficiency 

To determine a functional role for the rice ERECTA gene (SEQ ID NO: 3), lines of rice 
plants carrying transposon insertions that affect expression of that gene are analyzed. 

30 
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Nine such mutants were identified in the publicly available collection of transposon 
TOS17 insertional mutants at the Japanese NIAS Institute. The TOS17 retrotransposon 
is described in detail by Hirochika, Current Opinion in Plant Biology, 4, 1 18-122, 2001 
and by Hirochika Plant Mol Biol 35, 231-240, 1997, which is incorporated herein by 
5 reference. The nine mutant lines were identified through the website URL 
http://tos.nias. afire, go . jp/~miyao/pub/tos 1 1L and they have the accession numbers 
NG0578 (mutant A), ND3052 (mutant B), ND4028 (mutant C), NC0661 (mutant D), 
NE1049 (mutant E), NF8517 (mutant F), NE8025 (mutant G), NE3033 (mutant H) and 
NF8002 (mutant I). 

10 

Nine transposon insertional mutants were ordered from NIAS, that carry the TOS17 
stable retrotransposon insert in various parts of the ERECTA gene in the Nipponbare 
background, the genotype used for rice genome sequencing: NG0578, ND3052, 
ND4028, NC0661, NE1049, NF8517, NE8025, NE3033 and NF8002. 

15 

The transposon insertions in these nine lines affect the membrane spanning region of 
the protein (mutants I, D, E) or the Leucine Rich Repeat (LRR) domains (mutant H and 
G) in LRR 7 and LRR 18, respectively. In mutant B, TOS17 alters the coding sequence 
just upstream of sequences encoding the protein kinase domain I. In mutants C and F, 
20 the TOS17 insertion alters the sequence encoding domain Via of the ERECTA protein. 
In mutant A, the TOS7 insertion is in a sequence encoding a region between domains 
IX and X. The sequence information on these mutants is publicly available from the 
NIAS website. 

25 Using mutant seed for lines A-I received from NIAS, plants were grown for 
amplification of seed and analysis. Except in two mutants where several plants died, 
plants look healthy, with good growth indicating that, as in Arabidopsis, there is great 
potential to alter the ERECTA gene towards altered transpiration efficiency without 
adversely affecting growth and/or yield. 

30 
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Based upon sequence information for each mutant A-I, primers were designed to 
amplify the mutant erecta alleles from seedling material derived from 20 seeds. 
Amplification is performed under standard conditions, to identify for each mutant, 
plants that are homozygous, heterozygous or null at the ERECTA locus. Homozygous 
5 TOS17 mutants B and E, and heterozygous lines and null lines in all lines A-I were 
identified. 

In parallel to gaining information on whether or not the mutant lines were homozygous 
or heterozygous or null mutants, specific plant parts are removed for analysis of the 
10 consequences of the mutations on the ERECTA gene expression (levels and tissue 
specificity of expression), using quantitative real-time PCR as described herein. 
Additionally, the transpiration efficiency phenotype of each mutant line is determined 
by measuring C&O isotopic composition and ash contents of plant samples. 

15 Initial results on 13 C isotopic composition of mature blades of rice seedlings reveals 
significant variation between mutant lines (-32.8 to -34.2 per mil) and, in at least 4 
mutants, significant deviations from the wild type values, towards more negative 
values, suggesting that the erecta mutations do affect transpiration efficiency in rice, as 
in Arabidopsis. 

20 

Similar methods as above are applied to anaylzing the progeny of the mutant plants, to 
facilitate analysis of the effects of the erecta mutations under a range of conditions, 
including flooding (as is the most common practice for Nipponbare), water stress such 
as from soil drying (upland rice growth conditions) or low air humidity (heat spells). 
25 Differences in plant morphology, anatomy and apical dominance are noted under each 
environmental condition. Parameters that are characterised include tillering patterns, 
the anatomy of leaves and meristems, development and growth rates. 

Comparisons between mutants A-I are further used to characterize the role of the 
30 different protein domains in conferring different phenotypes observed for each line 
under different environmental and/or agricultural growth conditions. It is interesting 
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that, among the 4 mutants that exhibit much lower C isotopic composition than the wild 
type, three are those mutants where the TOS17 insert affects the membrane spanning 
region. 

5 

EXAMPLE 11 

Effect of silencing ERECTA gene expression on transpiration efficiency 

To confirm the role of the ERECTA gene in conferring the transpiration efficiency 
10 phenotype on a plant, expression of the wild-type ERECTA allele is reduced or 
inhibited using standard procedures in plant molecular biology, such as, for example, 
antisense inhibition of ERECTA expression, or the expression of inhibitory interfering 
RNA (RNAi) that targets ERECTA expression at the RNA level. All such procedures 
will be readily carried out by the skilled artisan using the disclosed nucleotide 
15 sequences of the ERECTA genes provided herein or sequences complementary thereto. 

For transformation of rice and Arabidopsis, transgenes are prepared in disarmed, non- 
tumorigenic binary vectors carrying T-DNA left and right borders and a selectable 
marker operable in E Coli. 

20 

Binary vectors used for DNA transfer include vectors selected from the group 
consisting of: 

1 . pPZP222 (Hajdukiewicz et al, 1994, Plant Mol Biol 25, 989-994); 

2. PBI 121 (Clonetech) (Ueda et al 1999, Protoplasma 206, 201-206); 
25 3. pOCA18 (Olszewski et al 1988, Nucl. AcidRes, 16 10765-10782); 

4. pGreen and pSoup or variants thereof (Hellens et al., 2000, Plant Mol Biol 42, 
819-832) and 

5. binary vectors developed on the pCAMBIA vectors backbone described at the 
webiste of CAMBIA 

30 
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The starting material for all these vectors was the backbone developed by 
Hajdukiewicz et al., 1994. The pPZP series of vectors comprise (i) a wide-host-range 
origin of replication from the Pseudomonas plasmid pVSl, which is stable in the 
absence of selection; (ii) the pBR322 origin of replication (pMB9-type) to allow high- 
5 yielding DNA preparations in E. coli; (iii) T-DNA left (LB) and right (RB) borders, 
including overdrive; and (iv) a CaMV35S promoter expression cassette. While the 
pPZP series of vectors also served as the backbones for the pCAMBIA series, they 
have been very extensively modified for particular applications. 

10 Vectors containing in their T-DNA various combinations of the following components 
are particularly preferred: 

1. hptH resistance gene cassette for conferring resistance to hygromycin on 
transformed plant material, wherein expression of hptn is operably under 
control of Ubil or 35S promoter; 
15 2. a reporter gene cassette comprising nucleic acid encoding the EGFP 
(Enhanced Green Fluorescence Protein) and/or beta-glucuronidase (GUS and 
GUSPlus) reporters; 

3. Gal4/VP16transactivator cassette; and 

4. one or more plant gene expression cassettes comprising either full-length or 
20 partial cDNAs of ERECTA genes in the sense or antisense orientation, or 

capable of expressing RNAi comprising sequences derived from the ERECTA 
gene, including any genomic fragments of plant DNA. 

The binary vectors are transferred to disarmed strain AGL1 of Agrobacterium 
25 tumefaciens by standard tri-parental matings (Ditta et al, 1980, Proc. Natl Acad, Set 
77,7347-7351) using the pRK2013 helper strain of E colt A thaliana plants are 
transformed using the standard floral dip method for transformation by disarmed strains 
of A tumefaciens (Clough and Bent, 1998, The Plant Journal 16, 735-743). Rice is 
transformed by generating embryogenic calli from excised embryos and subjecting the 
30 embryogenic calli to Agrobacterium tumefaciens mediated transformation according to 
published procedures (eg Wang et al 1997, J Gen cmdBreed, 51 325-334, 1997). 



WO 2004/005555 



'CT/AU2003/000854 



82 

Transformed plants are analyzed to confirm that those lines expressing antisense or 
RNAi constructs have reduced expression of functional ERECTA protein and more 
closely resemble the erecta phenotype than do wild-type plants or plants ectopically 
5 expressing a wild-type ERECTA gene in the sense orientation. 

EXAMPLE 12 

Identification of a sorghum ortholog of A thaliana ERECTA 

10 

An ortholog of the A. thaliana ERECTA allele (SEQ ID NO: 1) in sorghum was 
identified in silico by homology searching of the NCBI protein database using the 
BLAST programme under standard conditions. The input sequence was SEQ ID NO: 
2. The nucleotide sequence of the sorghum ortholog is presented in SEQ ID NO: 5, 
15 with the encoded protein comprising the amino acid sequence set forth in SEQ ID NO: 
6. 

EXAMPLE 13 

20 Identification of A. thaliana ERECTA homologs 

Two homologs of the A. thaliana ERECTA allele (SEQ ID NO: 1) were identified in 
silico by homology searching of the NCBI protein database using the BLAST 
programme under standard conditions. The input sequence was SEQ ID NO: 2. The 
25 nucleotide sequences of the A. thaliana ERECTA homologs are presented in SEQ ID 
NOs: 7 and 9, with the encoded proteins comprising the amino acid sequences set forth 
in SEQ ID NOs: 8 and 10, respectively. 

T-DNAinsertional mutants for these two homologous genes, both on chr5, have been 
30 identified in the Salk Institute mutant collection (web address: signal.salk.edu/cg:- 
bin/tdnaexpress). Several of these mutants were ordered: Salk_007643 and 
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Salk_026292 for gene At5g07180; Salk_045045 and SalkJ)81669for gene At5g62230. 
Primer pairs were designed in order to determine insert copy number and 
homo2ygozity/heterozygozity in the seedlings grown from the seeds that were received. 
Homozygous lines with 1 insert were identified and are under characterisation in order 
5 to compare the expression patterns (tissue localisation and mRNA levels) of the two 
genes and of the ERECTA gene across a range of environmental conditions and 
determine whether the three genes are functionally related. 

EXAMPLE 14 

10 Identification of wheat orthologs of A. thaliana ERECTA 

Partial cDNA sequence of orthologs of the A. thaliana ERECTA allele (SEQ ID NO: 1) 
in wheat were initially identified in silico by homology searching of the NCBI protein 
database using the BLAST programme under standard conditions. It was necessary, 
15 however, to conduct additional searches of private databases in order to link the partial 
sequences identified in the NCBI database. Correction of partial sequences located in 
the NCBI database was also necessary in order to generate a contig corresponding to 
the wheat ERECTA ortholog. 

20 The input sequence is the A. thaliana (SEQ ID NO: 2) or rice (SEQ ID NO: 4) amino 
acid sequences or a nucleotide sequence encoding same. The nucleotide sequences of 
the wheat ortholog are presented in SEQ ID NOs: 11-19, with the encoded proteins 
comprising the amino acid sequences set forth in SEQ ID NO: 20. 

25 The sequence set forth in SEQ ID NOs: 11 to 18 are partial cDNA sequences. The 
corresponding sequence of the wheat ERECTA ortholog (SEQ ID NO: 19) is isolated 
by standard nucleic acid hybridization screening of a wheat cDNA library. 

To confirm the role of the wheat ERECTA orthologs in transpiration efficiency, 
30 expression data sets are used for in silico studies of ERECTA gene expression in a 
range of tissues of wheat plants grown under a range of environmental conditions, 
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thereby providing indications of tissue specificities in expression patterns and 
preliminary data on the types of environments where the ERECTA ortholog is most 
likely to play a physiological role in relation to water use in this species. In these 
studies, nucleic acid comprising the sequence set forth in SEQ ID NO: 11 to 19, or a 
5 sequence complementary thereto, are used to produce hybridization probes and/or 
amplification primers. 

Additionally, an ERECTA gene (SEQ ID NO: 11 to 19) in the sense or antisense 
orientation is introduced into wheat, thereby producing transformed expression lines. 
10 Gene constructs are specifically to silence ERECTA gene expression using RNAi 
technology, or alternatively, to ectopically express the entire open reading frame of the 
gene. 

Based upon similar function, the open reading frame of the A. thaliana ERECTA gene 
15 (i.e., SEQ ID NO: 1) is also introduced into wheat plant material in the sense 
orientation, thereby ectopically expressing A. thaliana ERECTA in wheat. 

Gene constructs are introduced into wheat following any one of a number of standard 
procedures, such as, for example, using A. twnefaciens mediated transformation as 

20 described in published AU 738153 or EP 856,060-A1 or CA 2,230,216 to Monsanto 
Company, or using published biolistic transformation methods as described by 
Pellegrineschi et al, Genome 45(2), 421-30, 2002. Accordingly, genetic 
transformation is readily used to generate wheat lines with altered expression of an 
ERECTA gene. About 30 to 40 different transformants are produced, depending upon 

25 the efficiency of RNAi in reducing expression of ERECTA in wheat. 

Primary transformants (TO) are characterized to determine the number and loci at 
which transgenes are inserted. Tl and T2 segregating progenies are then generated 
from selected TO transformants, and analyzed to determine segregation ratio and to 
30 confirm the number of loci having inserted transgenes. Those Tl and/or T2 lines 
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having single transgene insertions are selected and used to generate and multiply seed 
for physiological studies. 

Water use efficiency in the Tl and/or T2 lines is determined through (a) gravimetric 
5 measurements of water transpired and biomass increases; (b) 13 C isotopic 
discrimination in plant tissues, (i.e., by determining A; and (c) ash content of plant 
tissue. 

Meristem and leaf development are also analyzed, especially with respect to the 
10 differentiation and anatomy of the epidermis, the stomatal complexes and the 

mesophyll tissue and by examining leaf gas exchange properties. This is done using 

microscopy, in situ imaging techniques and concurrent on-line measurements of C 

isotopic discrimination (A) and of CO2 and water fluxes in and out of leaves. 

Information on gene regulation and the network of genes in which the ERECTA 
15 ortholog operates in its effects on transpiration efficiency, is determined by 

transcriptome analysis of a restricted set of the transgenic lines with altered ERECTA 

expression. 

As described herein for A, thaliana and rice, correlations between physiological 
20 measurements and gene expression level or copy number confirm the role of the 
ortholog in conferring the transpiration efficiency phenotype in wheat. 



Partial cDNA sequence of ortholog of the A, thaliana ERECTA allele (SEQ ID NO: 1) 
in maize were initially identified in silico by homology searching of the NCBI protein 
database using the BLAST programme under standard conditions. It was necessary, 
30 however, to conduct additional searches of private databases in order to link the partial 
sequences identified in the NCBI database. Correction of partial sequences located in 



EXAMPLE 15 



25 



Identification of a maize ortholog of A, thaliana ERECTA 
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the NCBI database was also necessary in order to generate a contig corresponding to 
the maize ERECTA ortholog. 

The input sequence was SEQ ID NO: 2. The nucleotide sequence of a maize ortholog 
5 is presented in SEQ ID NOs: 21 to 44, with the encoded protein comprising the amino 
acid sequence set forth in SEQ ID NO: 45. 

The sequence set forth in SEQ ID NOs: 21 to 43 are partial cDNA sequences. The 
corresponding sequence of the maize ortholog (SEQ ID NO: 44) is isolated by standard 
10 nucleic acid hybridization screening of a wheat cDNA library. 

To confirm the role of the maize ERECTA ortholog in transpiration efficiency, 
expression data sets are used for in silico studies of ERECTA gene expression in a 
range of tissues of maize plants grown under a range of environmental conditions, 

15 thereby providing indications of tissue specificities in expression patterns and 
preliminary data on the types of environments where the ERECTA ortholog is most 
likely to play a physiological role in relation to water use in this species. In these 
studies, nucleic acid comprising the sequence set forth in SEQ ID NO: 15, or a 
sequence complementary thereto, is used to produce hybridization probes and/or 

20 amplification primers. 

Additionally, collections of transposon-tagged maize mutants are searched to select 
those having insertions that affect expression of the ERECTA gene and the expression 
level and/or copy number of the ERECTA ortholog is correlated to transpiration 
25 efficiency under the range of environmental growth conditions, essentially as described 
herein for A thaliana and rice. 

Additionally, an ERECTA gene in the sense or antisense orientation is introduced into 
maize, thereby producing transformed expression lines. Gene constructs are specifically 
30 to silence ERECTA gene expression using RNAi technology, or alternatively, to 
ectopically express the entire open reading frame of the gene. 
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Based upon similar function, the open reading frame of the A. thaliana ERECTA gene 
(i.e., SEQ ID NO: 1) is also introduced into maize plant material in the sense 
orientation, thereby ectopically expressing A thaliana ERECTA in maize. 

5 

Gene constructs are introduced into maize following any one of a number of standard 
procedures, such as, for example, any of the methods described by Gordon-Kamm et 
al., Plant Cell 2(7), 603-618, 1990; US Patent No. 5,177,010 to University of Toledo; 
US Patent No. 5,981,840 to Pioneer Hi-Bred; or published US application No. 
10 20020002711 Al (Goldman and Graves);. Accordingly, genetic transformation is used 
to generate maize lines with altered expression of an ERECTA gene. 

About 30 to 40 different transformants are produced, depending upon the efficiency of 
RNAi in reducing expression of ERECTA. 

15 

Primary transformants (TO) are characterized to determine the number and loci at 
which transgenes are inserted. Tl and T2 segregating progenies are then generated 
from selected TO transformants, and analyzed to determine segregation ratio and to 
confirm of number of loci having inserted transgenes. Those Tl and/or T2 lines having 
20 single transgene insertions are selected and used to generate and multiply seed for 
physiological studies. 

Water use efficiency in the Tl and/or T2 lines is determined through (a) gravimetric 
measurements of water transpired and biomass increases; (b) 13 C isotopic 
25 discrimination in plant tissues, (i.e., by determining A); and (c) ash content of plant 
tissue. 

Meristem and leaf development are also analyzed, especially with respect to the 
differentiation and anatomy of the epidermis, the stomatal complexes and the 
30 mesophyll tissue and by examining leaf gas exchange properties. This is done using 
microscopy, in situ imaging techniques and concurrent on-line measurements of A and 
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of CO2 and water fluxes in and out of leaves. Information on gene regulation and the 
network of genes in which the ERECTA ortholog operates in its effects on transpiration 
efficiency, is determined by transcriptome analysis of a restricted set of the transgenic 
lines with altered ERECTA expression. 

5 

As described herein for A. thaliana and rice, correlations between physiological 
measurements and gene expression level or copy number confirm the role of the 
ortholog in conferring the transpiration efficiency phenotype in maize. 

10 

EXAMPLE 16 

Mechanism of enhanced transpiration efficiency and inheritance of ERECTA 
in Arabidopsis (Landsberg and Columbia backgrounds) 

15 The present inventors performed direct measurements of transpiration efficiency (ratio 
of CO2 assimilation rate to transpiration rate) in both Landsberg and Columbia 
backgrounds. To confirm the role of ERECTA under a wider range of agronomically 
relevant conditions, the transpiration efficiencies of transformed plants carrying an 
ERECTA allele in response to varying environmental conditions (i.e., soil water and ion 

20 content, atmospheric humidity and CO2 levels) were determined and compared to the 
response of wild type plants (e.g., ER). Results of these experiments are presented in 
Figures 6-11, and Tables 8 and 9. 

Data in Figures 6 show that the enhanced transpiration efficiency obtained by inserting 
25 a transgene carrying the wild type ER allele in the Ld-erl mutant (line T2+ER) is 
mostly due to a decreased stomatal conductance. The phenotype of the transgenic line 
(T2+ER in graphs) is similar to that of a Ld-ER ecotype near isogenic to Ld-erl 
obtained from the Stock Centre (line 3177 on graphs). The increased transpiration 
efficiency in transgenic ER, compared to levels observed in wild type ER line is 
30 observed under both current ambient CO2 levels and increased CO2 levels that are 
within the limits predicted to occur worldwide over the next two decades. 
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Data in Figure 8 show that the reduced stomatal conductance in the ER T2 transgenic 
line compared to the Ld-erl line is, at least for a large part, caused by a reduced 
stomatal density (decrease in the number of stomata per unit area by more than hal£ 
5 down to similar levels as those observed in wild type Ld-ER). This decrease in stomatal 
density is relatively higher than that in the density of epidermal cells whose surface 
area is increased by only about 10%. It therefore follows that the ER transgene has 
affected stomatal development, specifically, and caused a decreased in stomatal index. 
These data show complementation with respect to the processes driving variation in 
10 transpiration efficiency. 

Reciprocal crosses were also performed between the two parental lines NW20 (Ld-erl) 
and Col4 (Col-ER). The notation Fl (Col*Ld) refers to the Fl plants where Col was the 
recipient of Ld pollen, while the notation Fl (Ld*Col) indicates the converse (Ld ovary 

15 receiving Col pollen). Initial analysis of these two types of Fl plants has been made 
for: gas exchange and photosynthetic properties, transpiration efficiency (Figure 7) and 
C isotopic composition (Table 9), rosette shape and developmental rate, anatomy of 
leaf epidermis (Figure 8), flowering date, inflorescence and pod shape. Consistent with 
our analysis of complementation experiments, the data show that the ERECTA gene 

20 affects all these phenotypes and not only inflorescence and pod shape. 

The data also show a complex inheritance of the ERECTA gene, such that the gene is 
dominant, with no reciprocal effect on pod shape (longer pods, longer stems and 
pedicels in all Fi plants, similar to the Col -ER parent). However, for other traits, 
25 results indicate maternal effects: hence the transpiration efficiency values (see Figure 
7a) and rosettes carbon isotope composition in Fi plants (Table 9) are intermediate 
between the parental values, but different between the two sets of Fl plants: values for 
Fi plants (Col*Ld) are closer to the Col values, while those for Fl plants (Ld*Col) are 
closer to values for the Ld parent. 

30 
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Data in Figure 8 indicate that stomatal conductance (transpiration per unit leaf area, 
Figure 8a) displays values close to the Ld-erl parent in all Fl plants, despite the 
stomatal densities being close to the Col-ER parent (Figure 8c). This shows that the ER 
gene affects not only epidermis development but also stomatal aperture (dynamics of 
5 stomata) and that while the ER effect on stomatal density appears to be dominant, 
effect on stomatal aperture is not. 

Data in Figure 9 show the effect of various er mutations (in Col background, mutants 
obtained from the Stock Centre or Dr Torii) on the number of stomata per unit leaf 
10 area. The stomatal densities for all but two of those mutants are greater than those the 
ColER wild type leaves, and confirm the effect of the ERECTA gene on that parameter. 

Data in Figure 10 show that enhanced transpiration efficiency in the ER transgenic line 
compared with null Ld-erl (no insertion of transgene) is confirmed by the less negative 

15 C isotopic composition values measured in leaf material (compare values for lines 
NW20 and CS20 (Ld erl; lines 16 and 17 on x-axis) and a transgenic T2 ld-ER line, 
homozygous for he ER transgene (line 19 in the Figure). The C isotopic values 
measured in the ER transgenic line are similar to those in the near isogenic Ld-ER 
ecotype (line 18 in Figure 10). This demonstrates complementation on this phenotypic 

20 trait, and validates once again the use of C isotopic composition as a quantitative 
indicator (substitute) of transpiration efficiency. 

Data in Figure 10 also the C isotopic compositions of a range of Col-er mutants, 
including those analysed in Figure 9 for stomatal densities. Most mutants show more 
25 negative C isotopic values than the COL-ER ecotype. This is consistent with the 
increased stoamtal densities described in Figure 9 and with all other comparisons of C 
isotopic compositions or direct measurements of transpiration efficiencies in er/ER 
lines and again indicative of the positive effect of the ER allele on transpiration 
efficiency. 

30 
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A few mutants in Figure 10 stand out, eg Col-erl05, or line 3140 (a line from NASC 
carrying the erl and gll-1 mutations). As genetic information is available for these 
mutants (nature and position of mutations) these mutants provide very useful functional 
information on the protein domain(s) of the ERECTA protein that are essential for 
5 conferring the transpiration efficiency phenotype and underlying processes. 

The present inventors also perform direct measurements of transpiration efficiency 
(ratio of CO2 assimilation rate to transpiration rate) in several T2 transformants 
generated in a Columbia background (i.e. transformation of mutant er-105 and er-2/106 

10 above). Results from these measurements are shown in Figure 11. These data show 
that the phenotype can be complemented in a Columbia background, as determined by 
measuring transpiration efficiency, transpiration and CO2 assimilation rates. 
Complementation is observed under conditions of both high humidity and low 
humidity, hence the demonstration that the ERECTA gene plays a role in the control of 

15 transpiration efficiency under both well watered and drought conditions, and that 
overexpression of that gene has the potential of increasing growth and resistance to 
drought and drought related stresses. 

More particularly, the data in Figure 11 demonstrate the role of the ERECTA gene on 
20 transpiration efficiency across a range of humidities, including low humidities such as 
prevail in warm and dry areas: 

- the er-105 mutant which carries a knock-out mutation of ERECTA (quasi no ER 
transcript) (open black squares) in ColO background has lower transpiration 
efficiency than the wild type near isogenic ColO (open triangles). 
25 - this mutant was transformed with an ER transgene under the 35S promoter and 
several ER homozygous T2 lines were produced (solid circles). Those lines (5 
independent transformants are included in the graph) have much increased 
transpiration efficiencies (+40 to 70%) compared to the null lines (solid 
squares) and similar to those measured for the wild type ER-ColO line, across 
30 the whole range of leaf to air vapour pressure deficit tested In our experiments. 
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Additionally, null lines that carry no transgene insertion but went through 
transformation and selection on antibiotics display similar values as the starting er-105 
mutant demonstrating that these manipulations themselves have no detectable 
confounding effect on transpiration efficiency. 
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TABLE 2 



Background 


Mutation 


Stock Centre name 


Isogenic ER line and 
Stock Centre Name 


Landsberg 


erl 


CS20orNW20 a 


3177 or CS163 










Columbia 


er2 b /erl06 


3401 


Coll or 3 176 










Columbia 


er!05 c 




Col3 with gll marker 
or ColO 











a, NW20 is an Lev parent for Lister and Dean's recombinant lines, carrying the erl 



5 mutation. Lines 3 177 or CS163 are the closest isogenic ER lines. 

b, er2 is an er allele identified by Redei in Col background. Coll or 3176 are the 
closest Col near-isogenic lines. The er2 is same mutation as mutation er-106 later 
reported by Torii and collaborators (Lease et al. 2001) 

c, erl 05 was isolated from a fast-neutron-irradiated Col seed population (Torii et al y 
10 1996). 

d, Col4, the Col parent for the Lister and Dean's parent was systamically included in all 
comparisons. 
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TABLE 3 

Comparison of er/ER lines in both Col and Ld background for carbon isotope discrimination values 
(per mil) in leaf material under a range of environmental conditions 


DifTerences in mean carbon isotope discrimination values (per mil) 


(9) 
erl-Coli 










0.73 


0.73 


0.15 


0.17 


0.28 


0.36 


o 

^ U 

w O 

T-H 
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0.16 


1.18 


i— i 
o 
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0.95 


1.22 


VO 
I— 1 


0.88 


0.95 


(7) 
erl-Col4 

(parental 
lines ior 
RILs) 










0.82 


0.73 


0.05 


0.06 


0.56 


0.33 


i> 
r- 










0.64 


0.74 


0.35 


0.28 


0.00 


0.39 


vo 
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0.38- 


VO 
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>-s 

3^ lA 
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TABLE 6 



Summary table of the lines used for initial functional characterisation 
and analysis of ER effects: 



Background 
ecotype 


Stable T2 homozygous ER transfbrmants 


Null er control 


Col-erl05 


T8; T29; T19; T61 


T18 


Col-erl06/er2/3401 


T165; T169; T279; T290 


T143 


Ld-erl 


T3-7K 


NW20 
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TABLE 7 



Carbon isotope composition (per mil) of 3 mature leaves, ground together 
harvested 4/6/03 ie 32 days after sowing from still vegetative rosettes 





T2 ER homozygous 
transformants 


Null 

transgenics 
(er) 


background 
er 

mutant 


Line: 


T46 T29 

-31.4 -31.2 


T18 

-32.2 


Col-erl05 


Line: 

Average: 
se: 


T145 T165 T279 T290 

-30.4 -31 -30.5 -30.8 

-30.6 
0.15 


T154 T143 T247 

-31.5 -32 -31.7 

-31.7 
0.12 


Col erl06/er2 
/3401 

-31.7 


line 


T3-7K 

-30.4 




Ld-erl 
NW20 

-31.3 
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TABLE 8 



Erecta 


Line 


C isotopic composition 








alleleLine 


name 


(per mil) 














Run 14 




Run 18 










Average 


St Err 


Average 




St Err 


Col_er 














mutants 


102K 








-29.4 


.011 




103K 








-29.0 


.10 




105C 


-31.4 


0.12 




-30.3 


.09 




105KH 


-30.2 






-29.3 


0.11 




105KS 


-29.8 


0.07 




-29.5 


.07 




3401 


-30 


0.21 




-29.4 


.05 




106C 


-30.2 


0.04 




-29.4 


.12 




108K 


-30.2 


0.07 




-29.5 


.06 




111KH 


-30.4 


0 




-29.5 


.11 




11 IKS 


-30.2 


0.11 




-29.7 


.12 




114K 


-30.2 


0.11 




-29.5 


.04 




116K 


-29.7 


0.21 




-29.0 


.13 




117K 


-29.7 


0.18 




-29.3 


.08 




3140 








-32.2 


.06 


ColO_ER 


1093 


-29.6 


0 


-29.0 


.10 


Ld_erl 


NW20 


-30.0 


0.25 




-28.9 


.15 




CS20 


-29.9 


0.08 








Ld_ER 


3177 


-29.4 






-28.2 


.09 


Transgenic 














Ld_erl 














+wild type 














ER 


3-7K 


-29.5 


0.07 




-28.4 


.10 
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TABLE 9 

5 





C isotope composition (per mil) 


ColJER (line 933) 


-28.1 


Fl (Col*Ld) 


-28.5 


Fl (Ld*COL) 


-29.3 


Ld-erl (lineNW20) 


-29.9 
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WE CLAIM: 

1. A method of selecting a plant having enhanced transpiration efficiency, 
comprising detecting a genetic marker for transpiration efficiency which marker 
comprises a nucleotide sequence linked genetically to an ERECTA locus in the genome 

5 of the plant and selecting a plant that comprises or expresses the genetic marker. 

2. The method according to claim 1 wherein the genetic marker comprises an 
ERECTA allele or erecta allele, or a protein-encoding portion thereof 

10 3. The method according to claim 2 wherein the genetic marker comprises a 
nucleotide sequence having at least about 55% overall sequence identity to at least 
about 20 nucleotides in length of any one of SEQ ID Nos: 1, 3, 5, 7, 9, 11 to 19 or 21 to 
44 or a complementary sequence thereto. 

15 4. The method according to claim 2 or 3 wherein the genetic marker comprises a 
nucleotide sequence selected from the group consisting of: 

(a) a sequence having at least about 55% identity to a sequence selected from the 
group consisting of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID 
NO: 7, SEQ ID NO: 9, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, SEQ 

20 ID NO: 14, SEQ DO NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, 

SEQ ID NO: 19 SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID 
NO: 24, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, 
SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 32 SEQ ID 
NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, 

25 SEQ ID NO: 38; SEQ ID NO: 39, SEQ ID NO: 40, SEQ ID NO: 41, SEQ ID 

NO: 42, SEQ ID NO: 43 and SEQ ID NO: 44; 

(b) a sequence encoding an amino acid sequence having at least about 55% identity 
to an amino acid sequence selected from the group consisting of SEQ ID NO: 2, 
SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 

30 12, SEQ ID NO: 20 and SEQ ID NO: 45; and 

(c) a sequence complementary to (a) or (b). 
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5. The method according to any one of claims 1 to 4 wherein the plant is selected 
from the group consisting of Arabidopsis thalicma, rice, sorghum, wheat and maize. 

5 6. The method according to any one of claims 1 to 5 comprising linking the 
transpiration efficiency phenotype of the plant to the expression of the marker in the 
plant. 

7. The method according to any one of claims 1 to 5 comprising linking a 
10 structural polymorphism in DNA to a transpiration efficiency phenotype in the plant. 

8. The method according to claim 7 wherein the polymorphism is determined by a 
process comprising detecting a restriction fragment length polymorphism (RFLP), 
amplified fragment length polymorphism (AFLP), single strand chain polymorphism 

15 (SSCP) or microsatellite analysis. 

9. The method according to any one of claims 1 to 8 comprising hybridizing a 
probe or primer of at least about 20 nucleotides in length from any one of SEQ ID Nos: 
1, 3, 5, 7, 9, 11 to 19 or 21 to 44 or a complementary sequence thereto to genomic DNA 

20 from the plant, and detecting the hybridization using a detection means. 

10. The method according to any one of claims 1 to 9 wherein the selected plant has 
enhanced transpiration efficiency compared to a near-isogenic plant that does not 
comprise or express the genetic marker. 

25 

11. A method of selecting a plant having enhanced transpiration efficiency, 
comprising: 

(a) screening mutant or near-isogenic or recombinant inbred lines of plants to segregate 
alleles at mERECTA locus; 
30 (b) identifying a polymorphic marker linked to said ERECTA locus; and 
(c) selecting a plant that comprises or expresses the marker. 
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12. A method of modulating the transpiration efficiency of a plant comprising 
introducing an isolated ERECTA gene or an allelic variant thereof or the protein- 
encoding region thereof to a plant and selecting a plant having a different transpiration 

5 efficiency compared to a near-isogenic plant that does not comprise the introduced 
ERECTA gene or allelic variant or protein-encoding region. 

13. The method according to claim 12 wherein the ERECTA gene or allelic variant 
or protein-encoding region comprises a nucleotide sequence selected from the group 

10 consisting of: 

(c) a sequence having at least about 55% identity to a sequence selected from the 



(d) a sequence encoding an amino acid sequence having at least about 55% identity 
to an amino acid sequence selected from the group consisting of SEQ ID NO: 2, 
SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 
12, SEQ ID NO: 20 and SEQ ID NO: 45. 



14. The method according to claim 12 or 13 wherein the plant is selected from the 
group consisting of Arabidopsis thaliana, rice, sorghum, wheat and maize. 



15 



20 



group consisting of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID 
NO: 7, SEQ ID NO: 9, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, SEQ 
ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, 
SEQ ID NO: 19; SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID 
NO: 24, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, 
SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 32 SEQ ID 
NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, 
SEQ ID NO: 38; SEQ ID NO: 39, SEQ ID NO: 40, SEQ ID NO: 41, SEQ ID 
NO: 42, SEQ ID NO: 43 and SEQ ID NO: 44; and 



25 



15. The method according to any one of claims 12 to 14 wherein the ERECTA gene 
30 or allelic variant or protein-encoding region is introduced to the plant by a process 
comprising introgression. 
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16. The method according to any one of claims 12 to 14 wherein the ERECTA gene 
or allelic variant or protein-encoding region is introduced to the plant by a process 
comprising transforming plant material with a gene construct comprising the gene or 

5 allelic variant or protein-encoding region thereof. 

17. The method according to any one of claims 12 to 16 further comprising 
expressing the introduced gene or allelic variant or protein encoding region in the plant. 

10 18. The method according to any one of claims 12 to 17 wherein transpiration 
efficiency is enhanced in the plant. 

19. The method of claim 18 wherein the transpiration efficiency is enhanced as a 
consequence of the ectopic expression of an ERECTA allele or the protein-encoding 

15 region thereof in the plant. 

20. The method according to any one of claims 12 to 17 wherein transpiration 
efficiency is reduced in the plant. 

20 21. The method of claim 20 wherein the transpiration efficiency is reduced as a 
consequence of reduced expression of an ERECTA allele in the plant. 

22. A plant having modified transpiration efficiency compared to a near-isogenic 
plant wherein said plant is produced by a process comprising performing the method 

25 according to any one of claims 12 to 21 . 

23. The plant of claim 22 selected from the group consisting of a rice plant, a wheat 
plant and a maize plant. 

30 24. An isolated ERECTA gene from wheat comprising a nucleotide sequence 
selected from the group consisting of: 
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(i) a sequence selected from the group consisting of: SEQ ID NO: 1 1, SEQ ID NO: 
12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 
17, SEQ ID NO: 18, and SEQ ID NO: 19; 

(ii) a sequence encoding the amino acid sequence set forth in SEQ ID NO: 20; and 
5 (iii) a sequence that is complementary to (i) or (ii). 

25. An isolated ERECT A gene from maize comprising a nucleotide sequence 
selected from the group consisting of: 

(i) a sequence selected from the group consisting of: SEQ ID NO: 21, SEQ ID NO: 
10 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 
27, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 
32 SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 
37, SEQ ID NO: 38; SEQ ID NO: 39, SEQ ID NO: 40, SEQ ID NO: 41, SEQ ID NO: 
42, SEQ ID NO: 43 and SEQ ID NO: 44; 
15 (ii) a sequence encoding the amino acid sequence set forth in SEQ ID NO: 45; and 

(iii) a sequence that is complementary to (i) or (ii). 

26. A gene construct comprising the isolated ERECTA gene according to claim 24 
or 25 operably in connection with a promoter sequence that is operable in a plant. 

20 

27. Use of an isolated ERECTA gene or allelic variant or protein-encoding region 
thereof in the preparation of a genetic construct for modulating the transpiration 
efficiency of a plant. 

25 28. Use according to claim 27 wherein the ERECTA gene or allelic variant or 
protein-encoding region comprises a nucleotide sequence selected from the group 
consisting of: 

(a) a sequence having at least about 55% identity to a sequence selected from the 
group consisting of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID 
30 NO: 7, SEQ ID NO: 9, SEQ ID NO: 1 1, SEQ ID NO: 12, SEQ ID NO: 13, SEQ 

ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, 
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SEQ ID NO: 19; SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID 
NO: 24, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, 
SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 32 SEQ ID 
NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, 

5 SEQ ID NO: 38; SEQ ID NO: 39, SEQ ID NO: 40, SEQ ID NO: 41, SEQ ID 

NO: 42, SEQ ID NO: 43 and SEQ ID NO: 44; and 
(b) a sequence encoding an amino acid sequence having at least about 55% identity 
to an amino acid sequence selected from the group consisting of SEQ ID NO: 2, 
SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 

10 12, SEQ ID NO: 20 and SEQ ID NO: 45. 

29. A method of identifying a gene that determines the transpiration efficiency of a 
plant comprising: 

(a) identifying a locus associated with genetic variation in transpiration efficiency 
15 in the plant; 

(b) identifying a gene or allele that is linked to said locus, wherein said gene or 
allele is a candidate gene or allele for determining the transpiration efficiency of 
a plant; and 

(c) determining the transpiration efficiencies of a panel of plants, wherein not all 
20 members of said panel comprise or express said gene or allele, and wherein 

variation in transpiration efficiency between the members of said panel 
indicates that said gene is involved in determining transpiration efficiency. 

30. A method of identifying a gene that determines the transpiration efficiency of a 
25 plant comprising: 

(a) identifying a locus associated with genetic variation in transpiration efficiency 
in a plant; 

(b) identifying multiple alleles of a gene that is linked to said locus, wherein said 
gene is a candidate gene involved for determining the transpiration efficiency of 

30 a plant; and 
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(d) determining the transpiration efficiencies of a panel of plants, wherein each 
member of said panel comprises at least one of said multiple alleles, wherein 
variation in transpiration efficiency between the members of said panel indicates 
that said gene is involved in determining transpiration efficiency. 

5 

31. A method of increasing the resistance of a plant to an environmental stress 
comprising enhancing the level of expression of an ERECTA gene or allelic 
variant thereof or protein encoding region thereof in said plant. 



10 32. A method of increasing seed or grain weight in a plant comprising enhancing the 
level of expression of an ERECTA gene or allelic variant thereof or protein 
encoding region thereof in said plant. 



33. A method of increasing the number of seeds produced by a plant comprising 
15 enhancing the level of expression of an ERECTA gene or allelic variant thereof 

or protein encoding region thereof in said plant. 



34. The method of any one of claims 31 to 33, the level of expression is enhanced 
by introducing an ERECTA gene or allelic variant thereof or the protein 
20 encoding region thereof to a plant. 



The method of claim 34 wherein the ERECTA gene or allelic variant or protein- 
encoding region comprises a nucleotide sequence selected from the group consisting of: 
(a) a sequence having at least about 55% identity to a sequence selected from the 
25 group consisting of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID 

NO: 7, SEQ ID NO: 9, SEQ ID NO: 1 1, SEQ ID NO: 12, SEQ ID NO: 13, SEQ 
ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, 
SEQ ID NO. 19 SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID 
NO: 24, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, 
30 SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 32 SEQ ID 

NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, 
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SEQ ID NO: 38; SEQ ID NO: 39, SEQ ID NO: 40, SEQ ID NO: 41, SEQ ID 

NO: 42, SEQ ID NO: 43 and SEQ ID NO: 44; and 
(b) a sequence encoding an amino acid sequence having at least about 55% identity 

to an amino acid sequence selected from the group consisting of SEQ ID NO: 2, 
5 SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 

12, SEQ ID NO: 20 and SEQ ID NO: 45. 
35. A plant produced by the method of any one of claims 3 1 to 34, 
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Figure 7a: Transpiration efficiency (mmol C mol H 2 0" 1 ) at 350 
and 500ppm C0 2 
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Figure 8a Stomatal conductance (mol H 2 0 m" 2 s" 1 ) 
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Figure 9a Number of stomata per mm 2 
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ALGAAQGlAYLHHICSPTUIHRDVKSKraiAXK^^ 

A YG AAQG 1A YLHHDC 3 P RI IHRDVKSSNI LLDKDLEARLTDFG IAK3 LCV S KSHT 3 TYVM 
AVGAAQG LAYLHHDCNPRI IHRDVKSSNI LLDSNFEAHLSDFGIA>3CVPSAKSHAS TYV L 
AVGAAQGLAYLHKDCNP11XVKRDVKSSNX liiDEHFEAHLSDFG IAKCV P AAKTHASTYV L 
• A************.***,******.*****!, **,*,*»****., t *|*.****, 

GTIGYIDPEYARTSRLNEKSDVYR LWHCSAG — AAEW0EASGQRIL3KTASNEVMD 



GTXGYXDPEYART SRLT EKSDVY SYGX VL t2 LLTiWKAVDDESinJlHLXMSKTGKNEVHS 
GTIGYroPEYARTSRmEKSDVYSFGrVLl^LLTGKKAVDNESNl^LIL^ 
GTIGYIDPEYARTSQLNEKSDVYS 



B recta_oorghuo 
Erecta^oalze 
Erecta^Arabidopcis 
Brecta_rice 
Erecta wheat 



TVS Pt» IGDTCKDIJSEVKKLFQ IALLCT 13tQPSt) RPTMHEVVRV XX>CLVNPD PP PKP 

TVDPBVGDTCKIJIxnWKKlTCLALLCTiaiQPS^ 

MADraiTSTCKDLGVVKXVrFQLALLCTKR^Pl^ 

AVDSEV3VTCTCKGLVRia^lALl^KRHP8DRP^^ 



Erocta_so rghua 

Eroctajoaise 

Erecta_Arabidop3ia 

Erecta_rice 

Erecta_wheat 



SAf^LPQPS PAVP 3TH IEYVSLRGTGA LSCWJ3T3 TSSAELFLXFGEAX SQKMS 

QKAHAHH0LP^PQP3PPAYVDSTVSUUjTGAL3CAII33ST3DABLF1JT 

DT3-AT lAG3CYVI>EYAm>KTPK3Vl^-SKSA3TVVQLFU^GQVISQNSE 

KTV-DYSRL1AS TT TAADMRGHDV TDIG-DN3SSDEQW7VRFGSVISKHTM 
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Ere eta sorghum 



Erecta maise 



Erecta krabidopsis 



Eracta rice 



Erecta wheat 
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SEQUENCE LISTING 

<110> The Australian National University 

5 <120> METHOD OF PRODUCING PLANTS HAVING ENHANCED TRANSPIRATION EFFICIENCY AND 
PLANTS PRODUCED THEREFROM 

<130> 94948/MRO 

10 <150> AU PS3339 
<151> 2002-07-02 

<160> 45 

15 <170> Patentln version 3.1 

<210> 1 
<211> 3176 
<212> DNA 

20 <213> Arabidopsis thaliana ERECTA allele 
<400> 1 

gtttcttctt catggagact. tgaaagcttt taaagtatat ctaaaaacgc agtcgtttta 60 
25 agactgtgtg tgagaaatgg ctctgtttag agatattgtt cttcttgggt ttctcttctg 120 
cttgagctta gtagctactg tgacttcaga ggagggagca acgttgctgg agattaagaa 180 
gtcattcaaa gatgtgaaca atgttcttta tgactggaca acttcacctt cttcggatta 240 

30 

ttgtgtctgg agaggtgtgt cttgtgaaaa tgtcaccttc aatgttgttg ctcttaattt 300 
gtcagatttg aatcttgatg gagaaatctc acctgctatt ggagatctca agagtctctt 360 
35 gtcaattgat ctgcgaggta atcgcttgtc tggacaaatc cctgatgaga ttggtgactg 420 
ttcttctttg caaaacttag acttatcctt caatgaatta agtggtgaca taccgttttc 480 
gatttcgaag ttgaagcaac ttgagcagct gattctgaag aataaccaat tgataggacc 540 

40 

gatcccttca acactttcac agattccaaa cctgaaaatt ctggacttgg cacagaataa 600 
actcagtggt gagataccaa gacttattta ctggaatgaa gttcttcagt atcttgggtt 660 
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gcgaggaaac aacttagtcg gtaacatttc tccagatttg tgtcaactga ctggtctttg 720 

gtattttgac gtaagaaaca acagtttgac tggtagtata cctgagacga taggaaattg 780 

5 

cactgccttc caggttttgg acttgtccta caatcagcta actggtgaga tcccttttga 840 

catcggcttc ctgcaagttg caacattatc attgcaaggc aatcaactct ctgggaagat 900 

10 tccatcagtg attggtctca tgcaagccct tgcagtctta gatctaagtg gcaacttgtt 960 

gagtggatct attcctccga ttctcggaaa tcttactttc accgagaaat tgtatttgca 1020 

cagtaacaag ctgactggtt caattccacc tgagcttgga aacatgtcaa aactccatta 1080 

15 

cctggaactc aatgataatc atctcacggg tcatatacca ccagagcttg ggaagcttac 1140 

tgacttgttt gatctgaatg tggccaacaa tgatctggaa ggacctatac ctgatcatct 1200 

20 gagctcttgc acaaatctaa acagcttaaa tgttcatggg aacaagttta gtggcactat 12 60 

accccgagca tttcaaaagc tagaaagtat gacttacctt aatctgtcca gcaacaatat 1320 

caaaggtcca atcccggttg agctatctcg tatcggtaac ttagatacat tggatctttc 1380 

25 

caacaacaag ataaatggaa tcattccttc ttcccttggt gatttggagc atcttctcaa 1440 

gatgaacttg agtagaaatc atataactgg tgtagttcca ggcgactttg gaaatctaag 1500 

30 aagcatcatg gaaatagatc tttcaaataa tgatatctct ggcccaattc cagaagagct 1560 

taaccaatta cagaacataa ttttgctgag actggaaaat aataacctga ctggtaatgt 1620 

tggttcatta gccaactgtc tcagtctcac tgtattgaat gtatctcata acaacctcgt 1680 

35 

aggtgatatc cctaagaaca ataacttctc aagattttca ccagacagct tcattggcaa 1740 

tcctggtctt tgcggtagtt ggctaaactc accgtgtcat gattctcgtc gaactgtacg 1800 

40 agtgtcaatc tctagagcag ctattcttgg aatagctatt gggggacttg tgatccttct 1860 

catggtctta atagcagctt gccgaccgca taatcctcct ccttttcttg atggatcact 1920 
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tgacaaacca gtaacttatt cgacaccgaa gctcgtcatc cttcatatga acatggcact 1980 

ccacgtttac gaggatatca tgagaatgac agagaatcta agtgagaagt atatcattgg 2040 

5 gcacggagca tcaagcactg tatacaaatg tgttttgaag aattgtaaac cggttgcgat 2100 

taagcggctt tactctcaca acccacagtc aatgaaacag tttgaaacag aactcgagat 2160 

gctaagtagc atcaagcaca gaaatcttgt gagcctacaa gcttattccc tctctcactt 2220 

10 

ggggagtctt ctgttctatg actatttgga aaatggtagc ctctgggatc ttcttcatgg 2280 

ccctacgaag aaaaagactc ttgattggga cacacggctt aagatagcat atggtgcagc 2340 

15 acaaggttta gcttatctac accatgactg tagtccaagg atcattcaca gagacgtgaa 2400 

gtcgtccaac attctcttgg acaaagactt agaggctcgt ttgacagatt ttggaatagc 24 60 

gaaaagcttg tgtgtgtcaa agtcacatac ttcaacttac gtgatgggca cgataggtta 2520 

20 

catagacccc gagtatgctc gcacttcacg gctcactgag aaatccgatg tctacagtta 2580 

tggaatagtc cttcttgagt tgttaacccg aaggaaagcc gttgatgacg aatccaatct 2640 

25 ccaccatctg ataatgtcaa agacggggaa caatgaagtg atggaaatgg cagatccaga 2700 

catcacatcg acgtgtaaag atctcggtgt ggtgaagaaa gttttccaac tggcactcct 2760 

atgcaccaaa agacagccga atgatcgacc cacaatgcac caggtgactc gtgttctcgg 2820 

30 

cagttttatg ctatcggaac aaccacctgc tgcgactgac acgtcagcga cgctggctgg 2880 

ttcgtgctac gtcgatgagt atgcaaatct caagactcct cattctgtca attgctcttc 2940 

35 catgagtgct tctgatgctc aactgtttct tcggtttgga caagttattt ctcagaacag 3000 

tgagtagttt ttcgttagga ggagaatctt taaaacggta tcttttcgtt gcgttaagct 3060 

gttagaaaaa ttaatgtctc atgtaaagta ttatgcactg ccttattatt attagacaag 3120 

40 

tgtgtggtgt gaatatgtct tcagactggc acttagactt cctataagtt cttgcc 3176 
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<210> 2 
<211> 976 
<212> PRT 

<213> Arabidopsis thaliana ERECTA allele 

5 

<400> 2 

Met Ala Leu Phe Arg Asp lie Val Leu Leu Gly Phe Leu Phe Cys Leu 
15 10 15 

10 

Ser Leu Val Ala Thr Val Thr Ser Glu Glu Gly Ala Thr Leu Leu Glu 
20 25 30 

15 

He Lys Lys Ser Phe Lys Asp Val Asn Asn Val Leu Tyr Asp Trp Thr 
35 40 45 



20 Thr Ser Pro Ser Ser Asp Tyr Cys Val Trp Arg Gly Val Ser Cys Glu 
50 55 60 



Asn Val Thr Phe Asn Val Val Ala Leu Asn Leu Ser Asp Leu Asn Leu 
25 65 70 75 80 



Asp Gly Glu He Ser Pro Ala He Gly Asp Leu Lys Ser Leu Leu Ser 
85 90 95 

30 

He Asp Leu Arg Gly Asn Arg Leu Ser Gly Gin He Pro Asp Glu He 
100 105 110 

35 

Gly Asp Cys Ser Ser Leu Gin Asn Leu Asp Leu Ser Phe Asn Glu Leu 
115 120 125 



40 Ser Gly Asp He Pro Phe Ser He Ser Lys Leu Lys Gin Leu Glu Gin 
130 135 140 
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Leu lie Leu Lys Asn Asn Gin Leu lie Gly Pro lie Pro Ser Thr Leu 
145 150 155 160 



5 Ser Gin lie Pro Asn Leu Lys lie Leu Asp Leu Ala Gin Asn Lys Leu 
165 170 175 



Ser Gly Glu He Pro Arg Leu He Tyr Trp Asn Glu Val Leu Gin Tyr 
10 180 185 190 



Leu Gly Leu Arg Gly Asn Asn Leu Val Gly Asn He Ser Pro Asp Leu 
195 200 205 

15 

Cys Gin Leu Thr Gly Leu Trp Tyr Phe Asp Val Arg Asn Asn Ser Leu 
210 215 220 

20 

Thr Gly Ser He Pro Glu Thr He Gly Asn Cys Thr Ala Phe Gin Val 
225 230 235 240 



25 Leu Asp Leu Ser Tyr Asn Gin Leu Thr Gly Glu He Pro Phe Asp He 

245 250 255 



Gly Phe Leu Gin Val Ala Thr Leu Ser Leu Gin Gly Asn Gin Leu Ser 
30 260 265 270 



Gly Lys He Pro Ser Val He Gly Leu Met Gin Ala Leu Ala Val Leu 
275 280 285 

35 

Asp Leu Ser Gly Asn Leu Leu Ser Gly Ser He Pro Pro He Leu Gly 
290 295 300 

40 

Asn Leu Thr Phe Thr Glu Lys Leu Tyr Leu His Ser Asn Lys Leu Thr 
305 310 315 320 
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6 

Gly Ser He Pro Pro Glu Leu Gly Asn Met Ser Lys Leu His Tyr Leu 
325 330 335 

5 

Glu Leu Asn Asp Asn His Leu Thr Gly His He Pro Pro Glu Leu Gly 
340 345 350 

10 Lys Leu Thr Asp Leu Phe Asp Leu Asn Val Ala Asn Asn Asp Leu Glu 
355 360 365 



Gly Pro He Pro Asp His Leu Ser Ser Cys Thr Asn Leu Asn Ser Leu 
15 370 375 380 



Asn Val His Gly Asn Lys Phe Ser Gly Thr He Pro Arg Ala Phe Gin 
385 390 395 400 

20 

Lys Leu Glu Ser Met Thr Tyr Leu Asn Leu Ser Ser Asn Asn He Lys 
405 410 415 

25 

Gly Pro He Pro Val Glu Leu Ser Arg He Gly Asn Leu Asp Thr Leu 
420 425 430 



30 Asp Leu Ser Asn Asn Lys He Asn Gly He He Pro Ser Ser Leu Gly 
435 440 445 



Asp Leu Glu His Leu Leu Lys Met Asn Leu Ser Arg Asn His He Thr 
35 450 455 460 



Gly Val Val Pro Gly Asp Phe Gly Asn Leu Arg Ser He Met Glu He 
465 470 475 480 

40 



Asp Leu Ser Asn Asn Asp He Ser Gly Pro He Pro Glu Glu Leu Asn 
485 490 495 
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Gin Leu Gin Asn lie lie Leu Leu Arg Leu Glu Asn Asn Asn Leu Thr 
500 505 510 

5 

Gly Asn Val Gly Ser Leu Ala Asn Cys Leu Ser Leu Thr Val Leu Asn 
515 520 525 

10 

Val Ser His Asn Asn Leu Val Gly Asp lie Pro Lys Asn Asn Asn Phe 
530 535 540 



15 Ser Arg Phe Ser Pro Asp Ser Phe lie Gly Asn Pro Gly Leu Cys Gly 
545 550 555 560 



Ser Trp Leu Asn Ser Pro Cys His Asp Ser Arg Arg Thr Val Arg Val 
20 565 570 575 



Ser He Ser Arg Ala Ala He Leu Gly He Ala He Gly Gly Leu Val 
580 585 590 

25 

He Leu Leu Met Val Leu He Ala Ala Cys Arg Pro His Asn Pro Pro 
595 600 605 

30 

Pro Phe Leu Asp Gly Ser Leu Asp Lys Pro Val Thr Tyr Ser Thr Pro 
610 615 620 



35 Lys Leu Val He Leu His Met Asn Met Ala Leu His Val Tyr Glu Asp 
625 630 635 640 



He Met Arg Met Thr Glu Asn Leu Ser Glu Lys Tyr He He Gly His 
40 645 650 655 



Gly Ala Ser Ser Thr Val Tyr Lys Cys Val Leu Lys Asn Cys Lys Pro 
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8 

660 665 670 



Val Ala lie Lys Arg Leu Tyr Ser His Asn Pro Gin Ser Met Lys Gin 
5 675 680 685 



Phe Glu Thr Glu Leu Glu Met Leu Ser Ser lie Lys His Arg Asn Leu 
690 695 700 

10 

Val Ser Leu Gin Ala Tyr Ser Leu Ser His Leu Gly Ser Leu Leu Phe 
705 710 715 720 

15 

Tyr Asp Tyr Leu Glu Asn Gly Ser Leu Trp Asp Leu Leu His Gly Pro 
725 730 735 



20 Thr Lys Lys Lys Thr Leu Asp Trp Asp Thr Arg Leu Lys He Ala Tyr 
740 745 750 



Gly Ala Ala Gin Gly Leu Ala Tyr Leu His His Asp Cys Ser Pro Arg 
25 755 760 765 



He He His Arg Asp Val Lys Ser Ser Asn He Leu Leu Asp Lys Asp 
770 775 780 

30 

Leu Glu Ala Arg Leu Thr Asp Phe Gly He Ala Lys Ser Leu Cys Val 
785 790 795 800 

35 

Ser Lys Ser His Thr Ser Thr Tyr Val Met Gly Thr He Gly Tyr He 
805 810 815 



40 Asp Pro Glu Tyr Ala Arg Thr Ser Arg Leu Thr Glu Lys Ser Asp Val 
820 825 830 
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Tyr Ser Tyr Gly lie Val Leu Leu Glu Leu Leu Thr Arg Arg Lys Ala 
835 840 845 



5 Val Asp Asp Glu Ser Asn Leu His His Leu lie Met Ser Lys Thr Gly 
850 855 860 



Asn Asn Glu Val Met Glu Met Ala Asp Pro Asp He Thr Ser Thr Cys 
10 865 870 875 880 



Lys Asp Leu Gly Val Val Lys Lys Val Phe Gin Leu Ala Leu Leu Cys 
885 890 895 

15 

Thr Lys Arg Gin Pro Asn Asp Arg Pro Thr Met His Gin Val Thr Arg 
900 905 910 

20 

Val Leu Gly Ser Phe Met Leu Ser Glu Gin Pro Pro Ala Ala Thr Asp 
915 920 925 



25 Thr Ser Ala Thr Leu Ala Gly Ser Cys Tyr Val Asp Glu Tyr Ala Asn 
930 935 940 



Leu Lys Thr Pro His Ser Val Asn Cys Ser Ser Met Ser Ala Ser Asp 
30 945 950 955 960 



Ala Gin Leu Phe Leu Arg Phe Gly Gin Val He Ser Gin Asn Ser Glu 
965 970 975 

35 

<210> 3 
<211> 3000 
<212> DNA 
40 <213> rice ERECTA 



<400> 3 

atggcggcgg cgagggcgcc gtggctgtgg tggtgggtgg tggtggttgt tggtgtggcg 
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gtggcggagg cggcctccgg aggaggagga gggggagatg gggaggggaa ggcgctgatg 
ggcgtgaagg ccggtttcgg gaacgcggcc aacgcgctcg tcgactggga cggcggcgcc 



gaccactgcg cgtggcgcgg cgtcacctgc gacaacgcct ccttcgccgt cctcgccctg 
aacttgtcaa atctaaacct aggaggtgag atctcgccgg ccatcggaga gctcaagaat 
10 ctacagttcg ttgatctcaa ggggaacaag ctcactggcc aaatcccaga tgagattggg 
gactgcatct ccttaaaata tttggatttg tctggcaact tgctgtatgg agacatcccc 
ttctccatct ccaagctcaa gcagcttgag gagctgattt tgaagaacaa ccagctcacg 



ggacccatcc cttccacatt gtcccaaatt ccaaatctca agacattgga cctggcacag 
aaccagctta caggcgatat cccaaggctc atatactgga atgaagttct gcaataccta 
20 ggtttgaggg gtaactcact gactggaact ttgtcacctg acatgtgcca actgactggc 
ctgtggtact ttgatgtaag gggaaacaat ctcacaggga ccattccaga gagcataggg 
aactgcacca gctttgagat tctggacatt tcgtataacc aaatctctgg agaaatacct 



tacaacatag gctttcttca agtagccaca ctgtcacttc aaggaaatag actgactggg 
aaaattccag atgtgattgg cctgatgcaa gctcttgctg ttctagacct gagtgagaac 
30 gagctggtag ggcccattcc ttctatactg ggcaatctat cctatactgg aaaactatat 
ttacatggga acaaacttac tggagtcata ccgccggagc ttgggaacat gagtaaactt 
agctacctac aactgaatga taatgaattg gtgggcacaa ttccagcaga gcttggcaaa 



cttgaagagc tttttgaact aaatcttgcc aacaacaatc ttcaaggtcc tattcctgca 
aacatcagtt cttgcactgc tctaaacaaa ttcaatgttt atggcaataa gctaaatggt 
40 tctattcctg ctggtttcca gaagttggag agtctgactt acttgaacct atcttcaaac 
aatttcaaag gcaatattcc ttctgagctt ggtcacatca tcaacttgga cacattggat 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
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ctttcctaca atgaattctc tggaccagtt cctgctacca ttggtgatct agagcacctt 1380 

cttgaactga atttgagtaa gaaccatctt gatgggccag ttcctgctga gtttggaaac 1440 

5 ttgagaagcg tccaagtaat tgatatgtcc aacaacaact tatctggtag tctgcccgag 1500 

gaacttggac aacttcaaaa ccttgatagc ctgattctta acaacaacaa tttggttggg 1560 

gagatccctg ctcaattggc caactgcttc agcttaaata accttgcatt tcaggaattt 1620 

10 

gtcatacaac aatttatctg gacatgtccc gatggcaaag aacttctcga aattcccaat 1680 

ggaaagcatc ttctaatttc tgattgcaac cagtacataa atcataaatg cagcttcttg 1740 

15 ggtaatccat tactgcatgt ttactgccaa gattccagct gtggacactc tcatggacaa 1800 

agagttaata tttcaaagac agcaattgct tgcattatct taggctttat catattgctc 18 60 

tgcgttctgc tgttggctat atataaaaca aatcaaccac agccacttgt caaaggatcc 1920 

20 

gataagccag tgcaaggacc tccaaagcta gttgttctcc agatggacat ggctatccat 1980 

acttacgagg acatcatgag gctgacagag aatttgagcg agaaatacat cattggctat 2040 

25 ggcgcctcaa gcactgtcta caaatgtgaa ctcaagagcg gcaaggccat tgctgtcaag 2100 

cggctttaca gtcagtataa ccatagcctc cgagagtttg aaacagaact agagacaatt 2160 

ggcagcatac ggcacaggaa tcttgttagc ctccatggct tctcgctatc tccacatgga 2220 

30 

aacttgctct tctatgatta catggaaaat ggttccttgt gggatcttct ccacggtcca 2280 

tcaaagaaag tgaagctcaa ctgggacaca agactgagga tcgcggtcgg agctgcacaa 2340 

35 gggctggcct atctccacca tgactgcaac cctcgcataa tccacagaga tgtcaagtcc 24 00 

tccaacatcc tgctcgacga gaacttcgaa gcgcacctct cagatttcgg catagccaaa 24 60 

tgtgtcccct ctgccaagtc ccatgcctcc acttatgtgc taggaaccat cggctacatt 2520 

40 

gatccggagt atgccaggac ttccaggctc aatgagaaat ctgatgtgta cagcttcggc 2580 

atcgtccttc tggaattgct cacagggaag aaggccgtcg acaacgaatc gaacttgcat 2640 
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caattgatac tctccaaagc tgatgacaac acagtcatgg aggcagtgga ctcggaggtg 2700 
tcagtgacgt gcacggacat gggactggtc aggaaggcct tccagctcgc ccttctgtgc 2760 



accaagaggc acccttcaga ccggccgacc atgcacgagg ttgcaagggt gctgctctcc 2820 

ctgctgccgg cctccgccat gacaacgccc aagacggtgg actactcccg gttgctggcg 28 80 

10 tcgacgacga cggcggccga catgcgaggg cacgacgtga ccgacatcgg cgacaacagc 2940 

tcctccgacg agcagtggtt cgtcaggttc ggcgaggtca tatccaagca cacaatgtga 3000 

15 <210> 4 

<211> 999 

<212> PRT 

<213> rice ERECTA 

20 <400> 4 

Met Ala Ala Ala Arg Ala Pro Trp Leu Trp Trp Trp Val Val Val Val 
15 10 15 



5 



25 



Val Gly Val Ala Val Ala Glu Ala Ala Ser Gly Gly Gly Gly Gly Gly 
20 25 30 



30 Asp Gly Glu Gly Lys Ala Leu Met Gly Val Lys Ala Gly Phe Gly Asn 
35 40 45 



Ala Ala Asn Ala Leu Val Asp Trp Asp Gly Gly Ala Asp His Cys Ala 



35 



50 



55 



60 



Trp Arg Gly Val Thr Cys Asp Asn Ala Ser Phe Ala Val Leu Ala Leu 
65 70 75 80 



40 



Asn Leu Ser Asn Leu Asn Leu Gly Gly Glu lie Ser Pro Ala lie Gly 
85 90 95 
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Glu Leu Lys Asn Leu Gin Phe Val Asp Leu Lys Gly Asn Lys Leu Thr 
100 105 110 

5 

Gly Gin lie Pro Asp Glu He Gly Asp Cys He Ser Leu Lys Tyr Leu 
115 120 125 

10 

Asp Leu Ser Gly Asn Leu Leu Tyr Gly Asp He Pro Phe Ser He Ser 
130 135 140 



15 Lys Leu Lys Gin Leu Glu Glu Leu He Leu Lys Asn Asn Gin Leu Thr 
145 150 155 160 



Gly Pro He Pro Ser Thr Leu Ser Gin He Pro Asn Leu Lys Thr Leu 
20 165 170 175 



Asp Leu Ala Gin Asn Gin Leu Thr Gly Asp He Pro Arg Leu He Tyr 
180 185 190 

25 

Trp Asn Glu Val Leu Gin Tyr Leu Gly Leu Arg Gly Asn Ser Leu Thr 
195 200 205 

30 

Gly Thr Leu Ser Pro Asp Met Cys Gin Leu Thr Gly Leu Trp Tyr Phe 
210 215 220 



35 Asp Val Arg Gly Asn Asn Leu Thr Gly Thr He Pro Glu Ser He Gly 
225 230 235 240 



Asn Cys Thr Ser Phe Glu He Leu Asp He Ser Tyr Asn Gin He Ser 
40 245 250 255 



Gly Glu He Pro Tyr Asn He Gly Phe Leu Gin Val Ala Thr Leu Ser 
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260 265 270 



Leu Gin Gly Asn Arg Leu Thr Gly Lys lie Pro Asp Val lie Gly Leu 
5 275 280 285 



Met Gin Ala Leu Ala Val Leu Asp Leu Ser Glu Asn Glu Leu Val Gly 
290 295 300 

10 

Pro lie Pro Ser He Leu Gly Asn Leu Ser Tyr Thr Gly Lys Leu Tyr 
305 310 315 320 

15 

Leu His Gly Asn Lys Leu Thr Gly Val He Pro Pro Glu Leu Gly Asn 
325 330 335 



20 Met Ser Lys Leu Ser Tyr Leu Gin Leu Asn Asp Asn Glu Leu Val Gly 
340 345 350 



Thr He Pro Ala Glu Leu Gly Lys Leu Glu Glu Leu Phe Glu Leu Asn 
25 355 360 365 



Leu Ala Asn Asn Asn Leu Gin Gly Pro He Pro Ala Asn He Ser Ser 
370 375 380 

30 

Cys Thr Ala Leu Asn Lys Phe Asn Val Tyr Gly Asn Lys Leu Asn Gly 
385 390 395 400 

35 

Ser He Pro Ala Gly Phe Gin Lys Leu Glu Ser Leu Thr Tyr Leu Asn 
405 410 415 



40 Leu Ser Ser Asn Asn Phe Lys Gly Asn He Pro Ser Glu Leu Gly His 
420 425 430 



WO 2004/005555 PCT/AU2003/000854 



15 

He He Asn Leu Asp Thr Leu Asp Leu Ser Tyr Asn Glu Phe Ser Gly 
435 440 445 



5 Pro Val Pro Ala Thr He Gly Asp Leu Glu His Leu Leu Glu Leu Asn 
450 455 460 



Leu Ser Lys Asn His Leu Asp Gly Pro Val Pro Ala Glu Phe Gly Asn 
10 465 470 475 480 



Leu Arg Ser Val Gin Val He Asp Met Ser Asn Asn Asn Leu Ser Gly 
485 490 495 

15 

Ser Leu Pro Glu Glu Leu Gly Gin Leu Gin Asn Leu Asp Ser Leu He 
500 505 510 

20 

Leu Asn Asn Asn Asn Leu Val Gly Glu He Pro Ala Gin Leu Ala Asn 
515 520 525 



25 Cys Phe Ser Leu Asn Asn Leu Ala Phe Gin Glu Phe Val He Gin Gin 
530 535 540 



Phe He Trp Thr Cys Pro Asp Gly Lys Glu Leu Leu Glu He Pro Asn 
30 545 550 555 560 



Gly Lys His Leu Leu He Ser Asp Cys Asn Gin Tyr He Asn His Lys 
565 570 575 

35 

Cys Ser Phe Leu Gly Asn Pro Leu Leu His Val Tyr Cys Gin Asp Ser 
580 585 590 



40 

Ser Cys Gly His Ser His Gly Gin Arg Val Asn He Ser Lys Thr Ala 
595 600 605 
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He Ala Cys He He Leu Gly Phe He He Leu Leu Cys Val Leu Leu 
610 615 620 

5 

Leu Ala He Tyr Lys Thr Asn Gin Pro Gin Pro Leu Val Lys Gly Ser 
625 630 635 640 



10 Asp Lys Pro Val Gin Gly Pro Pro Lys Leu Val Val Leu Gin Met Asp 

645 650 655 



Met Ala He His Thr Tyr Glu Asp He Met Arg Leu Thr Glu Asn Leu 
15 660 665 670 



Ser Glu Lys Tyr He He Gly Tyr Gly Ala Ser Ser Thr Val Tyr Lys 
675 680 685 

20 

Cys Glu Leu Lys Ser Gly Lys Ala He Ala Val Lys Arg Leu Tyr Ser 
690 695 700 

25 

Gin Tyr Asn His Ser Leu Arg Glu Phe Glu Thr Glu Leu Glu Thr He 
705 710 715 720 



30 Gly Ser He Arg His Arg Asn Leu Val Ser Leu His Gly Phe Ser Leu 

725 730 735 



Ser Pro His Gly Asn Leu Leu Phe Tyr Asp Tyr Met Glu Asn Gly Ser 
35 740 745 750 



Leu Trp Asp Leu Leu His Gly Pro Ser Lys Lys Val Lys Leu Asn Trp 
755 760 765 

40 



Asp Thr Arg Leu Arg He Ala Val Gly Ala Ala Gin Gly Leu Ala Tyr 
770 775 780 
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Leu His His Asp Cys Asn Pro Arg lie lie His Arg Asp Val Lys Ser 
785 790 795 800 

5 

Ser Asn He Leu Leu Asp Glu Asn Phe Glu Ala His Leu Ser Asp Phe 
805 810 815 

10 

Gly He Ala Lys Cys Val Pro Ser Ala Lys Ser His Ala Ser Thr Tyr 
820 825 830 



15 Val Leu Gly Thr He Gly Tyr He Asp Pro Glu Tyr Ala Arg Thr Ser 
835 840 845 



Arg Leu Asn Glu Lys Ser Asp Val Tyr Ser Phe Gly He Val Leu Leu 
20 850 855 860 



Glu Leu Leu Thr Gly Lys Lys Ala Val Asp Asn Glu Ser Asn Leu His 
865 870 875 880 

25 

Gin Leu He Leu Ser Lys Ala Asp Asp Asn Thr Val Met Glu Ala Val 
885 890 895 

30 

Asp Ser Glu Val Ser Val Thr Cys Thr Asp Met Gly Leu Val Arg Lys 
900 905 910 

35 Ala Phe Gin Leu Ala Leu Leu Cys Thr Lys Arg His Pro Ser Asp Arg 
915 920 925 



Pro Thr Met His Glu Val Ala Arg Val Leu Leu Ser Leu Leu Pro Ala 
40 930 935 940 



Ser Ala Met Thr Thr Pro Lys Thr Val Asp Tyr Ser Arg Leu Leu Ala 
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945 950 955 960 



Ser Thr Thr Thr Ala Ala Asp Met Arg Gly His Asp Val Thr Asp lie 
5 965 970 975 



Gly Asp Asn Ser Ser Ser Asp Glu Gin Trp Phe Val Arg Phe Gly Glu 
980 985 990 

10 

Val lie Ser Lys His Thr Met 
995 

15 

<210> 5 
<211> 2766 
<212> DNA 

<213> sorghum E RECTA 

20 

<400> 5 

atgacgacga cggccgcccg tgctctcgtc gccctcctcc tcgtcgccgt cgccgtcgcc 60 
gacgatgggg cgacgctggt ggagatcaag aagtccttcc gcaacgtcgg caacgtactg 120 

25 

tacgattggg ccggcgacga ctactgctcc tggcgcggcg tcctgtgcga caacgtcaca 180 
ttcgccgtcg ctgcgctcaa cctctctggc ctcaaccttg agggcgagat ctctccagcc 240 
30 gtcggcagcc tcaagagcct cgtctccatc gatctgaagt caaatgggct atccgggcag 300 
atccctgatg agattggtga ttgttcatca cttaggacgc tggacttttc tttcaacaac 3 60 
ttggatggcg acataccatt ttctatatca aagctgaagc acctggagaa cttgatattg 420 

35 

aagaacaacc agctgattgg tgcgatccca tcaacattgt cacagctccc aaatttgaag 480 
attttggatt tggcacaaaa caaactgact ggggagatac caaggcttat ctactggaat 540 
40 gaggttcttc aatatcttga tgtgaagaac aatagcttga ccggggtgat accagacacc 600 
attgggaact gtacaagttt tcaagtcttg gatttgtctt acaaccgctt tactggacca 660 
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atcccattca acattggttt cctacaagtg gctacactat ccttgcaagg gaacaagttc 720 

accggtccaa ttccttcagt aattggtctt atgcaggctc tcgctgttct agatctgagt 780 

5 tacaaccaat tatctggtcc tataccatca atactaggca acttgacata cactgagaag 840 

ctgtacatcc aaggcaataa gttaactggg tcgataccac cagagttagg aaatatgtca 900 

acacttcatt acctagaact gaacgataat caacttactg ggtcaattcc accagagctt 960 

10 

ggaaggctaa caggcttgtt tgacctgaac cttgcgaata accacctgga aggaccaatt 1020 

cctgacaacc taagttcatg tgtgaatctc aatagcttca atgcttatgg caacaagtta 1080 

15 aatgggacca ttcctcgttc gttgcggaaa cttgaaagca tgacctattt aaatctgtca 1140 

tcaaacttca taagtggctc tattcctatt gagttatcaa ggatcaacaa tttggacacg 1200 

ctggatttat cctgtaacat gatgactggt ccaattccat catcaattgg cagcctagag 1260 

20 

catctattga gacttaactt gagcaagaat ggtctagttg gattcatccc cgcggagttt 1320 

ggtaatttga ggagtgtcat ggagattgat ttatcctata atcaccttgg tggcctgatt 1380 

25 cctcaagaac ttgaaatgct gcaaaacctg atgttgctaa atgtgtcgta caataatttg 1440 

gctggtgttg tccctgctga caacaacttc acacggtttt cacctgacag ctttttaggt 1500 

aatcctggac tctgtggata ctggcttggt tcgtcgtgtc gttccactgg ccaccacgag 1560 

30 

aaaccgccta tctcaaaggc tgccataatt ggtgttgctg tgggtggact tgttatcctc 1620 

ttgatgatct tagtagctgt ttgcaggcca catcgtccac ctgcttttaa agatgtcact 1680 

35 gtaagcaagc cagtgagaaa tgctcccccc aagctggtga tccttcatat gaacatggcc 1740 

cttcatgtat acgatgacat aatgaggatg actgagaact tgagtgagaa atacatcatt 1800 

ggatacgggg cgtcaagtac agtttataaa tgtgtcctaa agaattgcaa accggtggca 1860 

40 

ataaaaaagc tgtatgccca ctacccacag agccttaagg aatttgaaac tgagcttgag 1920 

actgttggta gcatcaagca ccggaatcta gtcagccttc aagggtactc attatcacct 1980 
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gttgggaacc tcctctttta tgattatatg gaatgtggca gcttatggga tgttttacat 2040 
gaaggttcat ccaagaagaa aaaacttgac tgggagactc gcctacggat tgctcttggt 2100 

5 

gcagctcaag gccttgctta ccttcaccat gactgcagtc cacggataat tcatcgggat 2160 
gtaaaatcaa agaatatact ccttgacaaa gattatgagg cccatcttac agactttgga 2220 
10 attgctaaga gcttatgtgt ctcaaaaact cacacatcaa cctatgtcat gggaactatt 2280 
ggctacattg atcctgagta cgcccgcact tcccgtctca acgaaaagtc tgatgtctac 2340 
aggctatggc attgttctgc tggagctgct gactggcaag aagccagtgg acaacgaatc 2400 

15 

ctatcgaaga cggcaagcaa cgaggtcatg gataccgtgg accctgacat cggggacacc 24 60 
tgcaaggacc tcggcgaggt gaagaagctg ttccagctgg cgctcctttg caccaagcgg 2520 
20 caaccctcgg accgaccgac gatgcacgag gtggtgcgcg tcctggactg cctggtgaac 25 80 
ccggacccgc cgccaaagcc gtcggcgcac cagctgccgc agccgtcgcc agccgtgcca 2640 
agctacatca acgagtacgt cagcctgcgg ggcaccggcg ctctctcctg cgccaactcg 2700 

25 

accagcacct cggacgccga gctgttcctc aagttcggcg aggccatctc gcagaacatg 2760 
gagtag 2766 

30 

<210> 6 
<211> 921 
<212> PRT 

<213> Sorghum ERECTA 

35 

<400> 6 

Met Thr Thr Thr Ala Ala Arg Ala Leu Val Ala Leu Leu Leu Val Ala 
15 10 15 

40 



Val Ala Val Ala Asp Asp Gly Ala thr Leu Val Glu lie Lys Lys Ser 
20 25 30 
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Phe Arg Asn Val Gly Asn Val Leu Tyr Asp Trp Ala Gly Asp Asp Tyr 
35 40 45 

5 

Cys Ser Trp Arg Gly Val Leu Cys Asp Asn Val Thr Phe Ala Val Ala 
50 55 60 

10 

Ala Leu Asn Leu Ser Gly Leu Asn Leu Glu Gly Glu lie Ser Pro Ala 
65 70 75 80 



15 Val Gly Ser Leu Lys Ser Leu Val Ser lie Asp Leu Lys Ser Asn Gly 

85 90 95 



Leu Ser Gly Gin He Pro Asp Glu He Gly Asp Cys Ser Ser Leu Arg 
20 100 105 110 



Thr Leu Asp Phe Ser Phe Asn Asn Leu Asp Gly Asp He Pro Phe Ser 
115 120 125 

25 

He Ser Lys Leu Lys His Leu Glu Asn Leu He Leu Lys Asn Asn Gin 
130 135 140 

30 

Leu He Gly Ala He Pro Ser Thr Leu Ser Gin Leu Pro Asn Leu Lys 
145 150 155 160 



35 He Leu Asp Leu Ala Gin Asn Lys Leu Thr Gly Glu He Pro Arg Leu 

165 170 175 



He Tyr Trp Asn Glu Val Leu Gin Tyr Leu Asp Val Lys Asn Asn Ser 
40 180 185 190 



Leu Thr Gly Val He Pro Asp Thr He Gly Asn Cys Thr Ser Phe Gin 
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195 200 205 



Val Leu Asp Leu Ser Tyr Asn Arg Phe Thr Gly Pro lie Pro Phe Asn 
5 210 215 220 



lie Gly Phe Leu Gin Val Ala Thr Leu Ser Leu Gin Gly Asn Lys Phe 
225 230 235 240 

10 

Thr Gly Pro lie Pro Ser Val He Gly Leu Met Gin Ala Leu Ala Val 
245 250 255 

15 

Leu Asp Leu Ser Tyr Asn Gin Leu Ser Gly Pro He Pro Ser He Leu 
260 265 270 



20 Gly Asn Leu Thr Tyr Thr Glu Lys Leu Tyr He Gin Gly Asn Lys Leu 
275 280 285 



Thr Gly Ser He Pro Pro Glu Leu Gly Asn Met Ser Thr Leu His Tyr 
25 290 295 300 



Leu Glu Leu Asn Asp Asn Gin Leu Thr Gly Ser He Pro Pro Glu Leu 
305 310 315 320 

30 

Gly Arg Leu Thr Gly Leu Phe Asp Leu Asn Leu Ala Asn Asn His Leu 
325 330 335 

35 

Glu Gly Pro He Pro Asp Asn Leu Ser Ser Cys Val Asn Leu Asn Ser 
340 345 350 



40 Phe Asn Ala Tyr Gly Asn Lys Leu Asn Gly Thr He Pro Arg Ser Leu 
355 360 365 
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Arg Lys Leu Glu Ser Met Thr Tyr Leu Asn Leu Ser Ser Asn Phe lie 
370 375 380 



5 Ser Gly Ser lie Pro lie Glu Leu Ser Arg lie Asn Asn Leu Asp Thr 
385 390 395 400 



Leu Asp Leu Ser Cys Asn Met Met Thr Gly Pro He Pro Ser Ser He 
10 405 410 415 



Gly Ser Leu Glu His Leu Leu Arg Leu Asn Leu Ser Lys Asn Gly Leu 
420 425 430 

15 

Val Gly Phe He Pro Ala Glu Phe Gly Asn Leu Arg Ser Val Met Glu 
435 440 445 

20 

He Asp Leu Ser Tyr Asn His Leu Gly Gly Leu He Pro Gin Glu Leu 
450 455 460 



25 Glu Met Leu Gin Asn Leu Met Leu Leu Asn Val Ser Tyr Asn Asn Leu 
465 470 475 480 



Ala Gly Val Val Pro Ala Asp Asn Asn Phe Thr Arg Phe Ser Pro Asp 
30 485 490 495 



Ser Phe Leu Gly Asn Pro Gly Leu Cys Gly Tyr Trp Leu Gly Ser Ser 
500 505 510 

35 

Cys Arg Ser Thr Gly His His Glu Lys Pro Pro He Ser Lys Ala Ala 
515 520 525 



40 

He He Gly Val Ala Val Gly Gly Leu Val He Leu Leu Met He Leu 
530 535 540 
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Val Ala Val Cys Arg Pro His Arg Pro Pro Ala Phe Lys Asp Val Thr 
545 550 555 560 

5 

Val Ser Lys Pro Val Arg Asn Ala Pro Pro Lys Leu Val lie Leu His 
565 570 575 

10 Met Asn Met Ala Leu His Val Tyr Asp Asp He Met Arg Met Thr Glu 
580 585 590 



Asn Leu Ser Glu Lys Tyr He He Gly Tyr Gly Ala Ser Ser Thr Val 
15 595 600 605 



Tyr Lys Cys Val Leu Lys Asn Cys Lys Pro Val Ala He Lys Lys Leu 
610 615 620 

20 

Tyr Ala His Tyr Pro Gin Ser Leu Lys Glu Phe Glu Thr Glu Leu Glu 
625 630 635 640 

25 

Thr Val Gly Ser He Lys His Arg Asn Leu Val Ser Leu Gin Gly Tyr 
645 650 655 



30 Ser Leu Ser Pro Val Gly Asn Leu Leu Phe Tyr Asp Tyr Met Glu Cys 
660 665 670 



Gly Ser Leu Trp Asp Val Leu His Glu Gly Ser Ser Lys Lys Lys Lys 
35 675 680 685 



Leu Asp Trp Glu Thr Arg Leu Arg He Ala Leu Gly Ala Ala Gin Gly 
690 695 700 

40 



Leu Ala Tyr Leu His His Asp Cys Ser Pro Arg He He His Arg Asp 
705 710 715 720 
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Val Lys Ser Lys Asn He Leu Leu Asp Lys Asp Tyr Glu Ala His Leu 
725 730 735 

5 

Thr Asp Phe Gly He Ala Lys Ser Leu Cys Val Ser Lys Thr His Thr 
740 745 750 

10 

Ser Thr Tyr Val Met Gly Thr He Gly Tyr He Asp Pro Glu Tyr Ala 
755 760 765 



15 Arg Thr Ser Arg Leu Asn Glu Lys Ser Asp Val Tyr Arg Leu Trp His 
770 775 780 



Cys Ser Ala Gly Ala Ala Asp Trp Gin Glu Ala Ser Gly Gin Arg He 
20 785 790 795 800 



Leu Ser Lys Thr Ala Ser Asn Glu Val Met Asp Thr Val Asp Pro Asp 
805 810 815 

25 

He Gly Asp Thr Cys Lys Asp Leu Gly Glu Val Lys Lys Leu Phe Gin 
820 825 830 

30 

Leu Ala Leu Leu Cys Thr Lys Arg Gin Pro Ser Asp Arg Pro Thr Met 
835 840 845 



35 His Glu Val Val Arg Val Leu Asp Cys Leu Val Asn Pro Asp Pro Pro 
850 855 860 



Pro Lys Pro Ser Ala His Gin Leu Pro Gin Pro Ser Pro Ala Val Pro 
40 865 870 875 880 



Ser Tyr He Asn Glu Tyr Val Ser Leu Arg Gly Thr Gly Ala Leu Ser 
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885 890 895 

Cys Ala Asn Ser Thr Ser Thr Ser Asp Ala Glu Leu Phe Leu Lys Phe 
5 900 905 910 



Gly Glu Ala He Ser Gin Asn Met Glu 
915 920 

10 

<210> 7 
<211> 2751 
<212> DNA 

15 <213> Arabidopsis thaliana ERECTA homolog 
<400> 7 

atggcgataa aggcttcatt cagcaacgtg gcgaatatgc ttcttgattg ggacgatgtt 60 
20 cataaccacg acttttgttc ttggagaggt gtcttctgtg ataacgttag cctcaatgtt 120 
gtctctctta atctgtcaaa cctgaatctt ggtggagaga tatcatctgc ccttggagat 180 
ttgatgaatc tgcaatcaat agacttgcaa ggaaataaat tgggtggtca aattccagat 240 

25 

gagattggaa actgtgtttc tcttgcttat gtggatttct ccaccaattt gttgtttgga 300 
gacataccgt tttcaatctc taaactcaaa cagctgacct taactcagat tccaaacctt 360 
30 aagacccttg acctcgcaag aaaccagctt actggtgaga taccaaggtt actctactgg 420 
aatgaagttt tacagtatct cggtttacgt gggaatatgt taactgggac attgtctcct 480 
gatatgtgtc agctgacggg tctgtggtac tttgatgtga gaggcaacaa ccttactgga 540 

35 

actatcccag agagcattgg caattgcaca agctttgaga tcttggatgt atcttataat 600 
cagattaccg gagttatacc ctacaatatt ggtttcctcc aagtagctac tctgtcactt 660 
40 caaggaaaca agttgactgg cagaattccg gaagtgattg gtctgatgca ggctcttgct 720 
gtattggatt tgagtgacaa tgaattaact gggcctattc caccaatact tgggaatctg 780 
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tcattcactg gaaaactgta tctccatggc aacaagctca ctggacaaat cccacccgag 840 

ctaggcaata tgtcacgact cagctatttg caactaaatg ataatgaact agtgggaaag 900 

5 atcccacctg agcttgggaa gctggaacaa ttgttcgaac tgaatcttgc gaacaacaat 960 

cttgtagggc tgattccatc taacattagt tcctgtgctg ccttgaatca attcaatgtt 1020 

catgggaact tcttgagtgg agctgtacca cttgaattcc ggaatcttgg aagcttgact 1080 

10 

tatctaaatc tttcctcaaa cagtttcaag ggcaaaatac ctgctgagct tggccatatc 1140 

atcaatcttg atacattgga tctgtctggc aacaatttct caggctcaat tccattaaca 1200 

15 cttggtgatc ttgagcatct tctcatctta aacttgagca gaaatcatct gaatggcaca 1260 

ttgcctgcag aattcgggaa cctccgaagc attcagatca tcgatgtgtc atttaatttt 1320 

cttgccggtg ttattccaac tgaacttggc cagttgcaga acataaactc tctgatactg 1380 

20 

aacaacaaca agattcatgg gaaaatccct gatcagctaa ctaactgctt cagtcttgcc 1440 

aatctgaaca tctccttcaa taatctttct ggaataatcc cacctatgaa gaactttaca 1500 

25 cgtttttccc cggccagctt ctttggaaat ccatttctct gcgggaactg ggttggatca 15 60 

atctgtggcc catctttacc taagtcacaa gtattcacca gagttgccgt gatttgtatg 1620 

gttctcggtt tcatcactct catatgcatg atattcattg cggtttacaa gtcaaagcag 1680 

30 

cagaaaccag tcttgaaagg ctcttcaaaa caacctgaag ggtcaacgaa gctggtgatt 1740 

cttcacatgg acatggctat tcacacgttt gatgatatca tgagagttac agaaaacctc 1800 

35 gatgagaaat acatcattgg atacggtgct tctagcacag tttacaagtg cacctccaaa 18 60 

acttcccgac ctattgccat taagcgaatc tacaatcagt atcccagcaa cttccgcgag 1920 

tttgaaacag agctcgagac cattgggagc atcagacaca gaaacatagt aagcttgcac 1980 

40 

ggatacgcct tatctccctt tggcaacctc ctcttctacg actacatgga aaatggctct 2040 

ctttgggatc ttctccatgg gcctgggaag aaggtgaagc ttgactggga aacaaggctg 2100 
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aagatagctg ttggagctgc gcaaggactt gcatatcttc accatgactg cacacctagg 2160 

ataatccatc gagacatcaa gtcatcaaac atactccttg atgggaattt cgaagcgcgt 2220 

5 

ttgtcagatt ttgggattgc caagagcata ccagccacca aaacttatgc ttcaacctat 2280 

gttcttggaa ccattggata tattgaccca gagtatgctc gaacttcgcg tctgaacgag 2340 

10 aagtctgata tctacagttt cggtattgtc cttcttgagc ttctaaccgg caagaaggct 2400 

gtggataacg aggccaactt gcatcaaatg attctatcaa aggcggatga taacacagta 24 60 

atggaagctg ttgatgcaga ggtctcagtg acttgcatgg actcaggaca catcaagaaa 2520 



acatttcagc tagctctctt gtgcaccaag cgaaatcctt tggagagacc caccatgcag 2580 
gaggtctcta gggttctgct ctcacttgtc ccgtctccac ctccaaagaa gttaccgtcg 2640 
20 cctgcaaaag tacaggaagg ggaagaacgg cgtgagagcc actcttcaga tacaacaacc 2700 



25 <210> 8 

<211> 932 
<212> PRT 

<213> Arabidopsis thaliana ERECTA homolog 
30 <400> 8 

Met Ala lie Lys Ala Ser Phe Ser Asn Val Ala Asn Met Leu Leu Asp 
15 10 15 



15 



ccacagtggt 



ttgttcagtt ccgtgaagat atctccaaaa gtagcttata a 



2751 



35 



Trp Asp Asp Val His Asn His Asp Phe Cys Ser Trp Arg Gly Val Phe 
20 25 30 



40 



Cys Asp Asn Val Ser Leu Asn Val Val Ser Leu Asn Leu Ser Asn Leu 
35 40 45 
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Asn Leu Gly Gly Glu lie Ser Ser Ala Leu Gly Asp Leu Met Asn Leu 
50 55 60 



5 Gin Ser lie Asp Leu Gin Gly Asn Lys Leu Gly Gly Gin lie Pro Asp 
65 70 75 80 



Glu lie Gly Asn Cys Val Ser Leu Ala Tyr Val Asp Phe Ser Thr Asn 
10 85 90 95 



Leu Leu Phe Gly Asp He Pro Phe Ser He Ser Lys Leu Lys Gin Leu 
100 105 110 

15 

Glu Phe Leu Asn Leu Lys Asn Asn Gin Leu Thr Gly Pro He Pro Ala 
115 120 125 

20 

Thr Leu Thr Gin He Pro Asn Leu Lys Thr Leu Asp Leu Ala Arg Asn 
130 135 140 



25 Gin Leu Thr Gly Glu He Pro Arg Leu Leu Tyr Trp Asn Glu Val Leu 
145 150 155 160 



Gin Tyr Leu Gly Leu Arg Gly Asn Met Leu Thr Gly Thr Leu Ser Pro 
30 165 170 175 



Asp Met Cys Gin Leu Thr Gly Leu Trp Tyr Phe Asp Val Arg Gly Asn 
180 185 190 

35 

Asn Leu Thr Gly Thr He Pro Glu Ser He Gly Asn Cys Thr Ser Phe 
195 200 205 



40 

Glu He Leu Asp Val Ser Tyr Asn Gin He Thr Gly Val He Pro Tyr 
210 215 220 
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atcccattca acattggttt cctacaagtg 
accggtccaa ttccttcagt aattggtctt 
5 tacaaccaat tatctggtcc tataccatca 
ctgtacatcc aaggcaataa gttaactggg 
acacttcatt acctagaact gaacgataat 

10 

ggaaggctaa caggcttgtt tgacctgaac 
cctgacaacc taagttcatg tgtgaatctc 
15 aatgggacca ttcctcgttc gttgcggaaa 
tcaaacttca taagtggctc tattcctatt 
ctggatttat cctgtaacat gatgactggt 

20 

catctattga gacttaactt gagcaagaat 
ggtaatttga ggagtgtcat ggagattgat 
25 cctcaagaac ttgaaatgct gcaaaacctg 
gctggtgttg tccctgctga caacaacttc 
aatcctggac tctgtggata ctggcttggt 

30 

aaaccgccta tctcaaaggc tgccataatt 
ttgatgatct tagtagctgt ttgcaggcca 
35 gtaagcaagc cagtgagaaa tgctcccccc 
cttcatgtat acgatgacat aatgaggatg 
ggatacgggg cgtcaagtac agtttataaa 

40 

ataaaaaagc tgtatgccca ctacccacag 
actgttggta gcatcaagca ccggaatcta 
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gctacactat ccttgcaagg gaacaagttc 720 

atgcaggctc tcgctgttct agatctgagt 780 

atactaggca acttgacata cactgagaag 840 

tcgataccac cagagttagg aaatatgtca 900 

caacttactg ggtcaattcc accagagctt 960 

cttgcgaata accacctgga aggaccaatt 1020 

aatagcttca atgcttatgg caacaagtta 1080 

cttgaaagca tgacctattt aaatctgtca 1140 

gagttatcaa ggatcaacaa tttggacacg 1200 

ccaattccat catcaattgg cagcctagag 1260 

ggtctagttg gattcatccc cgcggagttt 1320 

ttatcctata atcaccttgg tggcctgatt 1380 

atgttgctaa atgtgtcgta caataatttg 1440 

acacggtttt cacctgacag ctttttaggt 1500 

tcgtcgtgtc gttccactgg ccaccacgag 1560 

ggtgttgctg tgggtggact tgttatcctc 1620 

catcgtccac ctgcttttaa agatgtcact 1680 

aagctggtga tccttcatat gaacatggcc 1740 

actgagaact tgagtgagaa atacatcatt 1800 

tgtgtcctaa agaattgcaa accggtggca 18 60 

agccttaagg aatttgaaac tgagcttgag 1920 

gtcagccttc aagggtactc attatcacct 1980 
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gttgggaacc tcctctttta tgattatatg gaatgtggca gcttatggga tgttttacat 2040 
gaaggttcat ccaagaagaa aaaacttgac tgggagactc gcctacggat tgctcttggt 2100 

5 

gcagctcaag gccttgctta ccttcaccat gactgcagtc cacggataat tcatcgggat 2160 
gtaaaatcaa agaatatact ccttgacaaa gattatgagg cccatcttac agactttgga 2220 
10 attgctaaga gcttatgtgt ctcaaaaact cacacatcaa cctatgtcat gggaactatt 2280 
ggctacattg atcctgagta cgcccgcact tcccgtctca acgaaaagtc tgatgtctac 2340 
aggctatggc attgttctgc tggagctgct gactggcaag aagccagtgg acaacgaatc 2400 

15 

ctatcgaaga cggcaagcaa cgaggtcatg gataccgtgg accctgacat cggggacacc 24 60 
tgcaaggacc tcggcgaggt gaagaagctg ttccagctgg cgctcctttg caccaagcgg 2520 
20 caaccctcgg accgaccgac gatgcacgag gtggtgcgcg tcctggactg cctggtgaac 2580 
ccggacccgc cgccaaagcc gtcggcgcac cagctgccgc agccgtcgcc agccgtgcca 2640 
agctacatca acgagtacgt cagcctgcgg ggcaccggcg ctctctcctg cgccaactcg 2700 

25 

accagcacct cggacgccga gctgttcctc aagttcggcg aggccatctc gcagaacatg 2760 
gagtag 2766 

30 

<210> 6 
<211> 921 
<212> PRT 

<213> Sorghum E RECTA 

35 

<400> 6 

Met Thr Thr Thr Ala Ala Arg Ala Leu Val Ala Leu Leu Leu Val Ala 
1 5 10 15 

40 



Val Ala Val Ala Asp Asp Gl-y Ala Thr Leu Val Glu lie Lys Lys Ser 
20 25 30 
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Phe Arg Asn Val Gly Asn Val Leu Tyr Asp Trp Ala Gly Asp Asp Tyr 
35 40 45 

5 

Cys Ser Trp Arg Gly Val Leu Cys Asp Asn Val Thr Phe Ala Val Ala 
50 55 60 

10 

Ala Leu Asn Leu Ser Gly Leu Asn Leu Glu Gly Glu lie Ser Pro Ala 
65 70 75 80 



15 Val Gly Ser Leu Lys Ser Leu Val Ser lie Asp Leu Lys Ser Asn Gly 

85 90 95 



Leu Ser Gly Gin lie Pro Asp Glu lie Gly Asp Cys Ser Ser Leu Arg 
20 100 105 110 



Thr Leu Asp Phe Ser Phe Asn Asn Leu Asp Gly Asp lie Pro Phe Ser 
115 120 125 

25 

He Ser Lys Leu Lys His Leu Glu Asn Leu He Leu Lys Asn Asn Gin 
130 135 140 

30 

Leu He Gly Ala He Pro Ser Thr Leu Ser Gin Leu Pro Asn Leu Lys 
145 150 155 160 



35 He Leu Asp Leu Ala Gin Asn Lys Leu Thr Gly Glu He Pro Arg Leu 

165 170 175 



He Tyr Trp Asn Glu Val Leu Gin Tyr Leu Asp Val Lys Asn Asn Ser 
40 180 185 190 



Leu Thr Gly Val He Pro Asp Thr He Gly Asn Cys Thr Ser Phe Gin 
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195 200 205 



Val Leu Asp Leu Ser Tyr Asn Arg Phe Thr Gly Pro He Pro Phe Asn 
5 210 215 220 



He Gly Phe Leu Gin Val Ala Thr Leu Ser Leu Gin Gly Asn Lys Phe 
225 230 235 240 

10 

Thr Gly Pro He Pro Ser Val He Gly Leu Met Gin Ala Leu Ala Val 
245 250 255 

15 

Leu Asp Leu Ser Tyr Asn Gin Leu Ser Gly Pro He Pro Ser He Leu 
260 265 270 

20 Gly Asn Leu Thr Tyr Thr Glu Lys Leu Tyr He Gin Gly Asn Lys Leu 
275 280 285 



Thr Gly Ser He Pro Pro Glu Leu Gly Asn Met Ser Thr Leu His Tyr 
25 290 295 300 



Leu Glu Leu Asn Asp Asn Gin Leu Thr Gly Ser He Pro Pro Glu Leu 
305 310 315 320 

30 

Gly Arg Leu Thr Gly Leu Phe Asp Leu Asn Leu Ala Asn Asn His Leu 
325 330 335 

35 

Glu Gly Pro He Pro Asp Asn Leu Ser Ser Cys Val Asn Leu Asn Ser 
340 345 350 



40 Phe Asn Ala Tyr Gly Asn Lys Leu Asn Gly Thr He Pro Arg Ser Leu 
355 360 365 
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Arg Lys Leu Glu Ser Met Thr Tyr Leu Asn Leu Ser Ser Asn Phe lie 
370 375 380 



5 Ser Gly Ser lie Pro lie Glu Leu Ser Arg lie Asn Asn Leu Asp Thr 
385 390 395 400 



Leu Asp Leu Ser Cys Asn Met Met Thr Gly Pro lie Pro Ser Ser lie 
10 405 410 415 



Gly Ser Leu Glu His Leu Leu Arg Leu Asn Leu Ser Lys Asn Gly Leu 
420 425 430 

15 

Val Gly Phe lie Pro Ala Glu Phe Gly Asn Leu Arg Ser Val Met Glu 
435 440 445 

20 

lie Asp Leu Ser Tyr Asn His Leu Gly Gly Leu lie Pro Gin Glu Leu 
450 455 460 



25 Glu Met Leu Gin Asn Leu Met Leu Leu Asn Val Ser Tyr Asn Asn Leu 
465 470 475 480 



Ala Gly Val Val Pro Ala Asp Asn Asn Phe Thr Arg Phe Ser Pro Asp 
30 485 490 495 



Ser Phe Leu Gly Asn Pro Gly Leu Cys Gly Tyr Trp Leu Gly Ser Ser 
500 505 510 

35 

Cys Arg Ser Thr Gly His His Glu Lys Pro Pro lie Ser Lys Ala Ala 
515 520 525 



40 

He He Gly Val Ala Val Gly Gly Leu Val He Leu Leu Met He Leu 
530 535 540 
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Asn lie Gly Phe Leu Gin Val Ala Thr Leu Ser LeuJSln Gly Asn Lys 
225 230 235 240 

5 

Leu Thr Gly Arg lie Pro Glu Val lie Gly Leu Met Gin Ala Leu Ala 
245 250 255 



10 Val Leu Asp Leu Ser Asp Asn Glu Leu Thr Gly Pro lie Pro Pro lie 
260 265 270 



Leu Gly Asn Leu Ser Phe Thr Gly Lys Leu Tyr Leu His Gly Asn Lys 
15 275 280 285 



Leu Thr Gly Gin lie Pro Pro Glu Leu Gly Asn Met Ser Arg Leu Ser 
290 295 300 

20 

Tyr Leu Gin Leu Asn Asp Asn Glu Leu Val Gly Lys lie Pro Pro Glu 
305 310 315 320 

25 

Leu Gly Lys Leu Glu Gin Leu Phe Glu Leu Asn Leu Ala Asn Asn Asn 
325 330 335 



30 Leu Val Gly Leu lie Pro Ser Asn He Ser Ser Cys Ala Ala Leu Asn 
340 345 350 



Gin Phe Asn Val His Gly Asn Phe Leu Ser Gly Ala Val Pro Leu Glu 
35 355 360 365 



Phe Arg Asn Leu Gly Ser Leu Thr Tyr Leu Asn Leu Ser Ser Asn Ser 
370 375 380 

40 



Phe Lys Gly Lys He Pro Ala Glu Leu Gly His He He Asn Leu Asp 
385 390 395 400 
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Thr Leu Asp Leu Ser Gly Asn Asn Phe Ser Gly Ser He Pro Leu Thx 
405 410 415 

5 

Leu Gly Asp Leu Glu His Leu Leu He Leu Asn Leu Ser Arg Asn His 
420 425 430 

10 

Leu Asn Gly Thr Leu Pro Ala Glu Phe Gly Asn Leu Arg Ser He Gin 
435 440 445 



15 He He Asp Val Ser Phe Asn Phe Leu Ala Gly Val He Pro Thr Glu 
450 455 460 



Leu Gly Gin Leu Gin Asn He Asn Ser Leu He Leu Asn Asn Asn Lys 
20 465 470 475 480 



He His Gly Lys He Pro Asp Gin Leu Thr Asn Cys Phe Ser Leu Ala 
485 490 495 

25 

Asn Leu Asn He Ser Phe Asn Asn Leu Ser Gly He He Pro Pro Met 
500 505 510 

30 

Lys Asn Phe Thr Arg Phe Ser Pro Ala Ser Phe Phe Gly Asn Pro Phe 
515 520 525 



35 Leu Cys Gly Asn Trp Val Gly Ser He Cys Gly Pro Ser Leu Pro Lys 
530 535 540 



Ser Gin Val Phe Thr Arg Val Ala Val He Cys Met Val Leu Gly Phe 
40 545 550 555 560 



He Thr Leu He Cys Met He Phe He Ala Val Tyr Lys Ser Lys Gin 
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565 570 575 



Gin Lys Pro Val Leu Lys Gly Ser Ser Lys Gin Pro Glu Gly Ser Thr 
5 580 585 590 



Lys Leu Val He Leu His Met Asp Met Ala He His Thr Phe Asp Asp 
595 600 605 

10 

He Met Arg Val Thr Glu Asn Leu Asp Glu Lys Tyr He He Gly Tyr 
610 615 620 

15 

Gly Ala Ser Ser Thr Val Tyr Lys Cys Thr Ser Lys Thr Ser Arg Pro 
625 630 635 640 



20 He Ala He Lys Arg He Tyr Asn Gin Tyr Pro Ser Asn Phe Arg Glu 

645 650 655 



Phe Glu Thr Glu Leu Glu Thr He Gly Ser He Arg His Arg Asn He 
25 660 665 670 



Val Ser Leu His Gly Tyr Ala Leu Ser Pro Phe Gly Asn Leu Leu Phe 
675 680 685 

30 

Tyr Asp Tyr Met Glu Asn Gly Ser Leu Trp Asp Leu Leu His Gly Pro 
690 695 700 

35 

Gly Lys Lys Val Lys Leu Asp Trp Glu Thr Arg Leu Lys He Ala Val 
705 710 715 720 



40 Gly Ala Ala Gin Gly Leu Ala Tyr Leu His His Asp Cys Thr Pro Arg 

725 730 735 
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He He His Arg Asp He Lys Ser Ser Asn He Leu Leu Asp Gly Asn 
740 745 750 



5 Phe Glu Ala Arg Leu Ser Asp Phe Gly He Ala Lys Ser He Pro Ala 
755 760 765 



Thr Lys Thr Tyr Ala Ser Thr Tyr Val Leu Gly Thr He Gly Tyr He 
10 770 775 780 



Asp Pro Glu Tyr Ala Arg Thr Ser Arg Leu Asn Glu Lys Ser Asp He 
785 790 795 800 

15 

Tyr Ser Phe Gly He Val Leu Leu Glu Leu Leu Thr Gly Lys Lys Ala 
805 810 815 

20 

Val Asp Asn Glu Ala Asn Leu His Gin Met He Leu Ser Lys Ala Asp 
820 825 830 



25 Asp Asn Thr Val Met Glu Ala Val Asp Ala Glu Val Ser Val Thr Cys 
835 . 840 845 



Met Asp Ser Gly His He Lys Lys Thr Phe Gin Leu Ala Leu Leu Cys 
30 850 855 860 



Thr Lys Arg Asn Pro Leu Glu Arg Pro Thr Met Gin Glu Val Ser Arg 
865 870 875 880 

35 

Val Leu Leu Ser Leu Val Pro Ser Pro Pro Pro Lys Lys Leu Pro Ser 
885 890 895 



40 

Pro Ala Lys Val Gin Glu Gly Glu Glu Arg Arg Glu Ser His Ser Ser 
900 905 910 
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Asp Thx Thr Thr Pro Gin Trp Phe Val Gin Phe Arg Glu Asp lie Ser 
915 920 925 

5 

Lys Ser Ser Leu 
930 

10 <210> 9 

<211> 2901 
<212> DMA 

<213> Arabidopsis thaliana ERECTA homolog 
15 <400> 9 

atgaaggaga agatgcagcg aatggtttta tctttagcaa tggtgggttt tatggttttt 60 
ggtgttgctt cggctatgaa caacgaaggg aaagctctga tggcgataaa aggctctttc 120 
20 agcaacttag tgaatatgct tttggattgg gacgatgttc acaacagtga cttgtgttct 180 
tggcgaggtg ttttctgcga caacgttagc tactccgttg tctctctgaa tttgtccagt 240 
ctgaatcttg gaggggagat atctccagct attggagacc tacggaattt gcaatcaata 300 

25 

gacttgcaag gtaataaact agcaggtcaa attccagatg agattggaaa ctgtgcttct 360 
cttgtttatc tggatttgtc cgagaatctg ttatatggag acataccttt ctcaatctct 420 
30 aaactcaagc agcttgaaac tctgaatctg aagaacaatc agctcacagg tcctgtacca 480 
gcaaccttaa cccagattcc aaaccttaag agacttgatc ttgctggcaa tcatctaacg 540 
ggtgagatat cgagattgct ttactggaat gaagttttgc agtatcttgg attacgaggg 600 

35 

aatatgttga ctggaacgtt atcttctgat atgtgtcagc taaccggttt gtggtacttt 660 
gatgtgagag gaaataatct aactggaacc atcccggaga gcatcggaaa ttgcacaagc 720 
40 tttcaaatcc tggacatatc ttataatcag ataacaggag agattcctta caatatcggc 780 
ttcctccaag ttgctactct gtcacttcaa ggaaacagat tgacgggtag aattccagaa 840 
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gttattggtc taatgcaggc tcttgctgtt 
cctatcccac cgatacttgg caatctctca 
5 atgctcactg gtccaatccc ctctgagctt 
ctaaacgaca ataaactagt gggaactatt 
tttgaactga atcttgccaa caaccgttta 

10 

tgtgcagcct tgaatcaatt caatgttcat 
gcgtttcgca atctcgggag cttgacttat 
15 aaaataccag ttgagcttgg acatataatc 
aacttctcag ggtctatacc attaacgctt 
cttagcagaa accatcttag tggacaatta 

20 

cagatgattg atgtatcatt caatctgctc 
ttgcagaatt taaactcttt aatattgaac 
25 cagcttacga actgcttcac tcttgtcaat 
atagtcccac caatgaaaaa cttctcacgt 
tatctttgtg gaaactgggt tggatctatt 

30 

tccagaggtg ctttgatctg cattgttctt 
cttgcagttt acaaatcaat gcagcagaag 
35 gaagggttaa ccaagctagt gattctccac 
atcatgagag tgactgagaa tcttaacgaa 
acggtataca aatgtgcatt aaaaagttcc 

40 

cagtatccgc ataacttgcg ggaatttgag 
cacagaaaca tagtcagctt gcatggatat 
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35 

ttggatttga gtgacaatga gcttgttggt 900 
tttaccggaa agttgtatct ccatggcaat 960 

gggaatatgt cacgtctcag ctatttgcag 1020 

ccacctgagc ttggaaagct ggagcaattg 1080 

gtagggccca taccatccaa cattagttca 1140 

gggaacctct tgagtggatc tattccactg 1200 

ctgaatcttt cgtcgaacaa tttcaaggga 1260 

aatcttgaca aactagatct gtctggcaat 1320 

ggcgatcttg aacaccttct catattaaat 1380 

cctgcagagt ttgggaacct tcgaagcatt 1440 

tccggagtta ttccaactga acttggccaa 1500 

aacaacaagc ttcatgggaa aattccagat 15 60 

ctgaatgtct ccttcaacaa tctctccggg 1620 

tttgctccag ccagctttgt tggaaatcca 1680 

tgtggtcctt taccgaaatc tcgagtattc 1740 

ggcgtcatca ctctcctatg tatgattttc 1800 

aagattctac aaggctcctc aaaacaagct 1860 

atggacatgg caattcatac atttgatgat 1920 

aagtttataa ttggatatgg tgcttctagc 1980 

cgacctattg ccattaagcg actctacaat 2040 

acagaacttg agaccattgg gagcattagg 2100 

gccttgtctc ctactggcaa ccttcttttc 2160 
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tatgactaca tggaaaatgg atcactttgg gaccttcttc atgggtcatt gaagaaagtg 
aagcttgatt gggagacaag gttgaagata gcggttggag ctgcacaagg actagcctat 

5 

cttcaccacg attgtactcc tcgaatcatt caccgtgaca tcaagtcatc gaacatactt 
cttgatgaga atttcgaagc acatttatct gatttcggga ttgctaagag cataccagct 
10 agcaaaaccc atgcctcgac ttatgttttg ggaacaattg gttatataga cccagagtat 
gctcgtactt cacgaatcaa tgagaaatcc gatatataca gcttcggtat tgttcttctt 
gagcttctca ctgggaagaa agcagtggat aacgaagcta acttgcatca actgatattg 

15 

tcaaaggctg atgataatac tgtgatggaa gcagttgatc cagaggttac tgtgacttgt 
atggacttgg gacatatcag gaagacattt cagctggctc tcttatgcac aaagcgaaac 
20 cctttagaga gacccacaat gcttgaagtc tctagggttc tgctctctct tgtcccatct 
ctgcaagtag caaagaagct accttctctt gatcactcaa ccaaaaagct gcagcaagag 
aatgaagtta ggaatcctga tgcagaagca tctcaatggt ttgttcagtt ccgtgaagtc 



atctccaaaa gtagcatata a 

<210> 10 
30 <211> 966 
<212> PRT 

<213> Arabidopsis thaliana ERECTA homolog 
<400> 10 

35 

Met Lys Glu Lys Met Gin Arg Met Val Leu Ser Leu Ala Met Val Gly 
IS 10 15 



2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2901 



40 



Phe Met Val Phe Gly Val Ala Ser Ala Met Asn Asn Glu Gly Lys Ala 
20 25 30 
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Leu Met Ala lie Lys Gly Ser Phe Ser Asn Leu Val Asn Met Leu Leu 
35 40 45 



5 Asp Trp Asp Asp Val His Asn Ser Asp Leu Cys Ser Trp Arg Gly Val 
50 55 60 



Phe Cys Asp Asn Val Ser Tyr Ser Val Val Ser Leu Asn Leu Ser Ser 
10 65 70 75 80 



Leu Asn Leu Gly Gly Glu lie Ser Pro Ala lie Gly Asp Leu Arg Asn 
85 90 95 

15 

Leu Gin Ser He Asp Leu Gin Gly Asn Lys Leu Ala Gly Gin He Pro 
100 105 HO 

20 

Asp Glu He Gly Asn Cys Ala Ser Leu Val Tyr Leu Asp Leu Ser Glu 
115 120 125 



25 Asn Leu Leu Tyr Gly Asp He Pro Phe Ser He Ser Lys Leu Lys Gin 
130 135 140 



Leu Glu Thr Leu Asn Leu Lys Asn Asn Gin Leu Thr Gly Pro Val Pro 
30 145 150 155 160 



Ala Thr Leu Thr Gin He Pro Asn Leu Lys Arg Leu Asp Leu Ala Gly 
165 170 175 

35 

Asn His Leu Thr Gly Glu He Ser Arg Leu Leu Tyr Trp Asn Glu Val 
180 185 190 

40 

Leu Gin Tyr Leu Gly Leu Arg Gly Asn Met Leu Thr Gly Thr Leu Ser 
195 200 205 
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Ser Asp Met Cys Gin Leu Thr Gly Leu Trp Tyr Phe Asp Val Arg Gly 
210 215 220 

5 

Asn Asn Leu Thr Gly Thr lie Pro Glu Ser lie Gly Asn Cys Thr Ser 
225 230 235 240 



10 Phe Gin He Leu Asp He Ser Tyr Asn Gin He Thr Gly Glu He Pro 

245 250 255 



Tyr Asn He Gly Phe Leu Gin Val Ala Thr Leu Ser Leu Gin Gly Asn 
15 260 265 270 



Arg Leu Thr Gly Arg He Pro Glu Val He Gly Leu Met Gin Ala Leu 
275 280 285 

20 

Ala Val Leu Asp Leu Ser Asp Asn Glu Leu Val Gly Pro He Pro Pro 
290 295 300 

25 

He Leu Gly Asn Leu Ser Phe Thr Gly Lys Leu Tyr Leu His Gly Asn 
305 310 315 320 



30 Met Leu Thr Gly Pro He Pro Ser Glu Leu Gly Asn Met Ser Arg Leu 

325 330 335 



Ser Tyr Leu Gin Leu Asn Asp Asn Lys Leu Val Gly Thr He Pro Pro 
35 340 345 350 



Glu Leu Gly Lys Leu Glu Gin Leu Phe Glu Leu Asn Leu Ala Asn Asn 
355 360 365 

40 



Arg Leu Val Gly Pro He Pro Ser Asn He Ser Ser Cys Ala Ala Leu 
370 375 380 
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Asn Gin Phe Asn Val His Gly Asn Leu Leu Ser Gly Ser He Pro Leu 
385 390 395 400 

5 

Ala Phe Arg Asn Leu Gly Ser Leu Thr Tyr Leu Asn Leu Ser Ser Asn 
405 410 415 

10 

Asn Phe Lys Gly Lys He Pro Val Glu Leu Gly His He He Asn Leu 
420 425 '430 



15 Asp Lys Leu Asp Leu Ser Gly Asn Asn Phe Ser Gly Ser He Pro Leu 
435 440 445 



Thr Leu Gly Asp Leu Glu His Leu Leu He Leu Asn Leu Ser Arg Asn 
20 450 455 460 



His Leu Ser Gly Gin Leu Pro Ala Glu Phe Gly Asn Leu Arg Ser He 
465 470 475 480 

25 

Gin Met He Asp Val Ser Phe Asn Leu Leu Ser Gly Val He Pro Thr 
485 490 495 

30 

Glu Leu Gly Gin Leu Gin Asn Leu Asn Ser Leu He Leu Asn Asn Asn 
500 505 510 

35 Lys Leu His Gly Lys He Pro Asp Gin Leu Thr Asn Cys Phe Thr Leu 
515 520 525 



Val Asn Leu Asn Val Ser Phe Asn Asn Leu Ser Gly He Val Pro Pro 
40 530 535 540 



Met Lys Asn Phe Ser Arg Phe Ala Pro Ala Ser Phe Val Gly Asn Pro 
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545 550 555 560 



Tyr Leu Cys Gly Asn Trp Val Gly Ser lie Cys Gly Pro Leu Pro Lys 
5 565 570 575 



Ser Arg Val Phe Ser Arg Gly Ala Leu lie Cys lie Val Leu Gly Val 
580 585 590 

10 

lie Thr Leu Leu Cys Met lie Phe Leu Ala Val Tyr Lys Ser Met Gin 
595 600 605 

15 

Gin Lys Lys lie Leu Gin Gly Ser Ser Lys Gin Ala Glu Gly Leu Thr 
610 615 620 



20 Lys Leu Val lie Leu His Met Asp Met Ala lie His Thr Phe Asp Asp 
625 630 635 640 



lie Met Arg Val Thr Glu Asn Leu Asn Glu Lys Phe lie lie Gly Tyr 
25 645 650 655 



Gly Ala Ser Ser Thr Val Tyr Lys Cys Ala Leu Lys Ser Ser Arg Pro 
660 665 670 

30 

He Ala He Lys Arg Leu Tyr Asn Gin Tyr Pro His Asn Leu Arg Glu 
675 680 685 

35 

Phe Glu Thr Glu Leu Glu Thr He Gly Ser He Arg His Arg Asn He 
690 695 700 



40 Val Ser Leu His Gly Tyr Ala Leu Ser Pro Thr Gly Asn Leu Leu Phe 
705 710 715 720 
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Tyr Asp Tyr Met Glu Asn Gly Ser Leu Trp Asp Leu Leu His Gly Ser 
725 730 735 



5 Leu Lys Lys Val Lys Leu Asp Trp Glu Thr Arg Leu Lys lie Ala Val 
740 745 750 



Gly Ala Ala Gin Gly Leu Ala Tyr Leu His His Asp Cys Thr Pro Arg 
10 755 760 765 



He He His Arg Asp He Lys Ser Ser Asn He Leu Leu Asp Glu Asn 
770 775 780 

15 

Phe Glu Ala His Leu Ser Asp Phe Gly He Ala Lys Ser He Pro Ala 
785 790 795 800 

20 

Ser Lys Thr His Ala Ser Thr Tyr Val Leu Gly Thr He Gly Tyr He 
805 810 815 



25 Asp Pro Glu Tyr Ala Arg Thr Ser Arg He Asn Glu Lys Ser Asp He 
820 825 830 



Tyr Ser Phe Gly He Val Leu Leu Glu Leu Leu Thr Gly Lys Lys Ala 
30 835 840 845 



Val Asp Asn Glu Ala Asn Leu His Gin Leu He Leu Ser Lys Ala Asp 
850 855 860 

35 

Asp Asn Thr Val Met Glu Ala Val Asp Pro Glu Val Thr Val Thr Cys 
865 870 875 880 



40 

Met Asp Leu Gly His He Arg Lys Thr Phe Gin Leu Ala Leu Leu Cys 
885 890 895 
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Thr Lys Arg Asn Pro Leu Glu Arg Pro Thr Met Leu Glu Val Ser Arg 
900 905 910 

5 

Val Leu Leu Ser Leu Val Pro Ser Leu Gin Val Ala Lys Lys Leu Pro 
915 920 925 

10 Ser Leu Asp His Ser Thr Lys Lys Leu Gin Gin Glu Asn Glu Val Arg 
930 935 940 



Asn Pro Asp Ala Glu Ala Ser Gin Trp Phe Val Gin Phe Arg Glu Val 
15 945 950 955 960 



lie Ser Lys Ser Ser lie 
965 

20 

<210> 11 
<211> 636 
<212> DNA 
25 <213> partial wheat E RECTA 

<400> 11 

atgaattctg caatacctag gcttgagggg taactcactg actggaacct tgtcacctga 60 

30 catgtgccaa ctcactggcc tgtggtactt tgatgtgagg ggcaacaatc taacaggaac 120 

aattccacag agcataggga actgcactag ctttgagatt ctggacattt catataacaa 180 

aatctctgga gaaatacctt acaacatagg tttccttcaa gtagctacac tgtcacttca 240 

35 

aggaaataga ctgactggga aaattccaga agtgattggc ctcatgcaag ctcttgctgt 300 

tcttgatctg agcgaaaacg aactagtagg ggccattcct ccgatactcg gcaacctgtc 360 

40 ctacactggc aaactatatt tgcatggcaa taaacttact ggtgaagtac ccccggaact 420 

tgggaacatg actaaactta gctacctgca actgaatgac aatgaattag tgggcgcaat 4 80 
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tccagctgag cttgggaaac ttgaagagct attcgaatta aatcttgcca acaacaatct 540 

tgagggtcct attcctacaa acatcagttc ttgcactgca ctaaacaaat tcaatgttta 600 

5 cggcaataga ttgaacggtt ctatccctgc tggttt 636 

<210> 12 

<211> 466 

10 <212> DNA 

<213> partial wheat E RECTA 

<400> 12 

ttcaatgttt atggcaatag attgaacggt tctatccctg ctggtttcca gaatttggag 60 

15 

agtttgacta acttgaattt atcctcaaac aattttaaag gccatatccc atctgaactt 120 
ggtcatatca tcaatttgga cacactggat ctttcctaca atgaactctc tggaccagtt 180 
20 cctgctacta ttggtgatct tgagcatctt cttcaactaa atttgagcaa aaaccatctt 240 
agcgggtcag tgcctgctga gttcggaaac ttgagaagca tccaagtaat tgatttatcc 300 
aacaacgcca tgtctggtta tctccctgaa gaactaggcc aacttcagaa ccttgatagt 3 60 

25 

ttgattctta acaacaatat tttggtcgga gagatccctg ctcagttggc taactgcttc 420 
agcttaaaca tcttgaactt gtcacataac aacttttctg gacatg 466 

30 

<210> 13 
<211> 372 
<212> DNA 

<213> partial wheat ERECTA 

35 

<220> 

<221> misc_ feature 
<222> (62).. (62) 
<223> not determined 

40 



<400> 13 

ttgtcagcct tctggcttct cactctctcc caatggaaac ctgctcttct acgattacat 



60 
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gngaaaacgg ttccttgtgg gatcttctcc atggtccatc aaagaaagtg aagcttgact 
gggacacccg actgaggatc gcggtcggcg cggcacaagg gctggcctat ctgcaccatg 

5 

actgcaatct gcggatagtc cacagggacg tcaagtcctc caacatcctg ctcgacgagc 
actttgaagc gcatctctcg gacttcggca tcgccaaatg cgtcccggca gccaagaccc 
10 atgcgtccac atatgtgcta ggaaccatcg gctacatcga tccagagtac gcccggacgt 
cgaggttgaa eg 

15 <210> 14 
<211> 357 
<212> DNA 

<213> partial wheat ERECTA 
20 <400> 14 

ttgactaact tgaatttatc ctcaaacaat tttaaaggee atatcccatc tgaacttggt 
catatcatca atttggacac actggatctt tcctacaatg aactctctgg accagttcct 
25 gctactattg gtgatcttga gcatcttctt caactaaatt tgagcaaaaa ccatcttagc 
gggtcagtgc ctgctgagtt eggaaacttg agaagcatcc aagtaattga tttatccaac 
aacgecatgt ctggttatct ccctgaagaa ctaggccaac ttcagaacct tgatagtttg 



attcttaaca acaatatttt ggteggggag atccctgctc agttggctaa ctgette 



<210> 15 

35 <211> 314 

<212> DNA 

<213> partial wheat ERECTA 



120 
180 
240 
300 
360 
372 



60 
120 
180 
240 
300 
357 



<400> 15 

40 cacactggat ctttcctaca atgaattctc tggaccagac cctgctacta ttggtgatct 



tgagcatgtt cttcagatta aatttgagca aaaaccatct tactgggeca atgcctgctg 



60 
120 
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agttctgaaa cttgagaagc atccaagtaa ttgatttatc caacaacgcc atgtctggtt 
atctccctga agaactacgc caacttcaga atcttgatag tttgatgctt aacaacaata 
5 ctttggttgg ggagatccct gctcatctgg ctaactgctt caacttaaac atcttgaact 
tgccatataa caac 

10 <210> 16 
<211> 549 
<212> DNA 

<213> partial wheat ERECTA 
15 <400> 16 

catcatcggc tacggcgctt caagtaccgt gtataaatgt gtgctcaaga gtggcaaggc 
cattgctgtg aagcggctct acagccaata caaccatggc gcccgtgagt ttgagacaga 
20 gctggagaca gtcggtagca tccggcacag gaatcttgtc agccttcatg gcttctcact 
ctctccaaat ggaaacctgc tcttctacga ttacatggaa aatggttcct tgtgggatct 
tctccacggt ccatcaaaga aggtgaaact tgactgggac acccgactga gaatcgccgt 



cggtgcggca caagggctgg catatcttca ccatgactgc aaccctcgga tagtccacag 
ggacgtcaag tcctccaaca tcctgctcga cgagcacttt gaagcgcatc tctcggactt 
30 cggcatcgcc aaatgcgtcc cagctgccaa gacccacgcg tccacctatg tgctaggaac 
catcggctac atcgatccag agtacgcccg gacgtcccag ctgaacgaga aatctgatgt 
gtacagctt 



<210> 17 

<211> 615 

<212> DNA 

40 <213> partial wheat ERECTA 

<400> 17 

ccacgcgtcg atcatcaatt gggacacacg ggatctttcc tacaatgaat tctccgggcc 



180 
240 
300 
314 



60 
120 
180 
240 
300 
360 
420 
480 
540 
549 



35 
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agttcctgct actattggtg atctggagca tcttcttcaa ctaaatttga gcaaaaacca 120 

tcttagtggg tctgtgcctg ctgagttcgg aaacttgaga agcatccaag taattgattt 180 

5 

atccaacaac gccatttctg gttatctccc tgaagaacta ggccaacttc agaaccttga 240 

tagtttgatt cttaacaaca atactttggt tggggagatc cctgctcagt tggctaactg 300 

10 cttcagctta aacatcttga acttgtcata taacaacttt tctggacatg tcccattcgc 360 

taagaacttc tcaaagttcc ccggggaaag cttcttggga aatccgatgc tgagcgttca 420 

ctgcaaagac tccagctgtg gcaactctca tggatcaaaa gtgaatactc ggacagcgat 480 

15 

tgcttgcatc atctcgggct tcgtcatatt gctctgtgtt ctgctattgg gcaatatata 540 

aaacaaagcg accacagcca cctatcaaag catctgataa accagggcaa ggacctccaa 600 

20 agatagtact cctcc 615 

<210> 18 

<211> 719 

25 <212> DMA 

<213> partial wheat E RECTA 

<400> 18 

cgttcactgc aaagactcca gctgtggcaa ctctcatgga tcaaaagtga atattcggac 60 

30 

ggcgattgct tgcatcatct cgggcttcgt catactgcta tgtgttctgc tattggcaat 120 

atataaaaca aagcgaccac agccacctat caaagcatct gataaaccag tgcaaggacc 180 

35 tccaaagata gtactcctcc aaatggacat ggctatccat acctatgatg atattatgag 240 

gctgacagag aatttgagcg agaaatacat catcggctac ggcgcttcaa gtaccgtgta 300 

taaatgtgtg ctcaagagtg gcaaggccat tgctgtgaag cggctctaca gccaatacaa 360 

40 

ccatggcgcc cgtgagtttg agacagagct ggagacagtc ggtagcatcc ggcacaggaa 420 

tcttgtcagc cttcatggct tctcactctc tccaaatgga aacctgctct tctacgatta 480 
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catggaaaat ggttccttgt gggatcttct ccacggtcca tcaaagaagg tgaaacttga 540 
ctgggacacc cgactgagaa tcgccgtcgg tgcggcacaa gggctggcat atcttcacca 600 

5 

tgactgcaac cctcggatag tccacaggga cgtcaagtcc tccaacatcc tgctcgacga 660 
gcactttgaa gcgcatctct cggacttcgg catcgcccaa tgcgtcccca gctgccaag 719 

10 

<210> 19 

<211> 1346 

<212> DNA 

<213> wheat E RECTA 

15 

<4O0> 19 

ttcaatgttt atggcaatag attgaacggt tctatccctg ctggtttcca gaatttggag 60 
agtttgacta acttgaattt atcctcaaac aattttaaag gccatatccc atctgaactt 120 

20 

ggtcatatca tcaatttgga cacactggat ctttcctaca atgaactctc tggaccagtt 180 
cctgctacta ttggtgatct tgagcatctt cttcaactaa atttgagcaa aaaccatctt 240 
25 agcgggtcag tgcctgctga gttcggaaac ttgagaagca tccaagtaat tgatttatcc 300 
aacaacgcca tgtctggtta tctccctgaa gaactaggcc aacttcagaa ccttgatagt 360 
ttgattctta acaacaatat tttggtcggg gagatccctg ctcagttggc taactgcttc 420 

30 

agcttaaaca tcttgaactt gtcacataac aacttttctg gacatgtccc attcgctaag 480 
aacttctcaa agttccccgg ggaaagcttc ttgggaaatc cgatgctgag cgttcactgc 540 
35 aaagactcca gctgtggcaa ctctcatgga tcaaaagtga atactcggac agcgattgct 600 
tgcatcatct cgggcttcgt catactgcta tgtgttctgc tattggcaat atataaaaca 660 
aagcgaccac agccacctat caaagcatct gataaaccag ggcaaggacc tccaaagata 720 

40 

gtactcctcc aaatggacat ggctatccat acctatgatg atattatgag gctgacagag 780 
aatttgagcg agaaatacat catcggctac ggcgcttcaa gtaccgtgta taaatgtgtg 840 
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ctcaagagtg gcaaggccat tgctgtgaag cggctctaca gccaatacaa ccatggcgcc 900 
cgtgagtttg agacagagct ggagacagtc ggtagcatcc ggcacaggaa tcttgtcagc 960 

5 

cttcatggct tctcactctc tccaaatgga aacctgctct tctacgatta catggaaaat 1020 
ggttccttgt gggatcttct ccacggtcca tcaaagaagg tgaaacttga ctgggacacc 1080 
10 cgactgagaa tcgccgtcgg tgcggcacaa gggctggcat atcttcacca tgactgcaac 1140 
cctcggatag tccacaggga cgtcaagtcc tccaacatcc tgctcgacga gcactttgaa 1200 
gcgcatctct cggacttcgg catcgccaaa tgcgtcccag ctgccaagac ccacgcgtcc 12 60 

15 

acatatgtgc taggaaccat cggctacatc gatccagagt acgcccggac gtcccagctg 1320 
aacgagaaat ctgatgtgta cagctt 134 6 

20 

<210> 20 

<211> 448 

<212> PRT 

<213> wheat ERECTA 

25 

<400> 20 

Phe Asn Val Tyr Gly Asn Arg Leu Asn Gly Ser lie Pro Ala Gly Phe 
15 10 15 

30 

Gin Asn Leu Glu Ser Leu Thr Asn Leu Asn Leu Ser Ser Asn Asn Phe 
20 25 30 

35 

Lys Gly His lie Pro Ser Glu Leu Gly His lie lie Asn Leu Asp Thr 
35 40 45 



40 Leu Asp Leu Ser Tyr Asn Glu Leu Ser Gly Pro Val Pro Ala Thr lie 
50 55 60 
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Gly Asp Leu Glu His Leu Leu Gin Leu Asn Leu Ser Lys Asn His Leu 
65 70 75 80 



5 Ser Gly Ser Val Pro Ala Glu Phe Gly Asn Leu Arg Ser lie Gin Val 
85 90 95 



He Asp Leu Ser Asn Asn Ala Met Ser Gly Tyr Leu Pro Glu Glu Leu 
10 100 105 110 



Gly Gin Leu Gin Asn Leu Asp Ser Leu He Leu Asn Asn Asn He Leu 
115 120 125 

15 

Val Gly Glu He Pro Ala Gin Leu Ala Asn Cys Phe Ser Leu Asn He 
130 135 140 

20 

Leu Asn Leu Ser His Asn Asn Phe Ser Gly His Val Pro Phe Ala Lys 
145 150 155 160 



25 Asn Phe Ser Lys Phe Pro Gly Glu Ser Phe Leu Gly Asn Pro Met Leu 

165 170 175 



Ser Val His Cys Lys Asp Ser Ser Cys Gly Asn Ser His Gly Ser Lys 
30 180 185 190 



Val Asn Thr Arg Thr Ala He Ala Cys He He Ser Gly Phe Val He 
195 200 205 

35 

Leu Leu Cys Val Leu Leu Leu Ala He Tyr Lys Thr Lys Arg Pro Gin 
210 215 220 



40 

Pro Pro He Lys Ala Ser Asp Lys Pro Gly Gin Gly Pro Pro Lys He 
225 230 235 240 
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Val Leu Leu Gin Met Asp Met Ala lie His Thr Tyr Asp Asp He Met 
245 250 255 

5 

Arg Leu Thr Glu Asn Leu Ser Glu Lys Tyr He He Gly Tyr Gly Ala 
260 265 270 

10 Ser Ser Thr Val Tyr Lys Cys Val Leu Lys Ser Gly Lys Ala He Ala 
275 280 285 



Val Lys Arg Leu Tyr Ser Gin Tyr Asn His Gly Ala Arg Glu Phe Glu 
15 290 295 300 



Thr Glu Leu Glu Thr Val Gly Ser He Arg His Arg Asn Leu Val Ser 
305 310 315 320 

20 

Leu His Gly Phe Ser Leu Ser Pro Asn Gly Asn Leu Leu Phe Tyr Asp 
325 330 335 

25 

Tyr Met Glu Asn Gly Ser Leu Trp Asp Leu Leu His Gly Pro Ser Lys 
340 345 350 

30 Lys Val Lys Leu Asp Trp Asp Thr Arg Leu Arg He Ala Val Gly Ala 
355 360 365 



Ala Gin Gly Leu Ala Tyr Leu His His Asp Cys Asn Pro Arg He Val 
35 370 375 380 



His Arg Asp Val Lys Ser Ser Asn He Leu Leu Asp Glu His Phe Glu 
385 390 395 400 

40 

Ala His Leu Ser Asp Phe Gly He Ala Lys Cys Val Pro Ala Ala Lys 
405 410 415 
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Thr His Ala Ser Thr Tyr Val Leu Gly Thr lie Gly Tyr lie Asp Pro 



420 



425 



430 



Glu Tyr Ala Arg Thr Ser Gin Leu Asn Glu Lys Ser Asp Val Tyr Ser 



435 



440 



445 



10 



15 



20 



25 



30 



35 



<210> 21 
<211> 1273 
<212> DNA 

<213> partial maize ERECTA 
<400> 21 

cctggactct gtggatattg gcttggttct tcatgtcgtt ccactggcca ccgagacaaa 60 

ccgccaatct caaaggctgc cataattggt gttgctgtgg gtggacttgt tatcctcctg 120 

atgatcttag tagctgtatg caggccacac catccacctg cttttaaaga tgccactgta 180 

agcaagccag tgagcaatgg tccacccaag ctggtgatcc ttcatatgaa catggctctt 240 

catgtctttg atgatataat gaggatgact gagaacttga gtgagaaata catcattgga 300 

tacggggcat caagtacagt ttataaatgt gttctaaaga attgcaaacc agtggcaata 360 

aaaaagctgt atgcccacta ccctgcagag ccttaaggaa tttgaaactg agctcgagac 420 

tgttggtagc atcaaacacc ggaatctagt cagccttgcc aagggtactc gttgtcacct 4 80 

gttgggaacc tcctctttta tgattatatg gagagtggca gcttatggga tgttttacat 540 

gaaggctcat ccaagaagaa caaacttgac tgggtgactc gcctacggat cgctcttggt 600 

gcagctcaag gcctcgctta ccttcaccat gactgcagcc cacgaataat tcaccgggac 660 

gtaaaatcaa agaatatact cctcgacaaa gattatgagg cccatcttac agacttcggc 720 

atcgctaaga gcttatgtgt ctcgaagact cacacgtcaa cctacgtcat gggcactatt 780 

ggttacattg atcccgagta cgcccgcacc tcccgcctca acgagaagtc tgatgtctac 840 
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agctacggca tcgttctgct ggagctgctg accggcaaga agccagtgga caacgagtgc 900 

aatctccatc acttgatcct atcgaagacg gcgagcaacg aggtcatgga gacggtggac 960 

5 

cccgacgtgg gagacacctg caaggacctg ggcgaggtga agaagctgtt ccagctggcg 1020 

ctcctctgca ccaagcggca gccctcggac cggccgacga tgcacgaggt ggtgcgcgtc 1080 

10 cttgactgcc tggtgaaccc ggagccgccg ccgcagccgc agcagcagca gcagaagggc 1140 

gcacgcgcac caccagctgc cgccgcagcc gtcgccgccg gcctacgtcg acgagtacgt 1200 

cagcctgcgg ggcactggcg ccctctcctg cgccaactcg tccagcacct cggacgccga 12 60 

15 

gctgttcctc aag 1273 



<210> 22 
20 <211> 100 
<212> DNA 

<213> partial maize ERECTA 
<400> 22 

25 cacaaaatgt cagtcaaact actccccctg caatcggcct cactcaaggc gcctcaccga 60 
acgtctacgt cttcccctac accatgttct gcgagatggc 100 



30 <210> 23 

<211> 599 

<212> DNA. 

<213> partial maize ERECTA 

35 <220> 

<221> mis cofeature 

<222> (529) . . (529) 

<223> not determined 

40 

<400> 23 

tttttttttt tttttttttt tttttgagga ggaagctccg ctgctcttgc gttgcgtcac 60 
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atgacttttt acagctaaca acaccctagc 
cgtcccacaa aatgtcagtc aaactactcc 
5 cgaacgtcta cgtcttcccc tacaccatgt 
acagctcggc gtccgaggtg ctggacgagt 
ggctgacgta ctcgtcgacg taggccggcg 

10 

cgtgcgcctt ctgctgctgc tgctgcggct 
caaggacgcg caccacctcg tgcatcgtcg 
15 ggagcgccag ctggaacagc ttcttcacct 
cggggtccac cgtctccatg acctcgttgc 

20 <210> 24 

<211> 436 

<212> DNA 

<213> partial maize ERECTA 

25 <400> 24 

tttttttttt ttttttttga agaagctccg 

acagataaca ccaccctagc tactgagtcc 

30 aatgtcagcc aaactactcc ctgcaatcgg 

cgtcttcccc tacaccatgt tctgcgagat 

gtccgaggtg ctggacgagt tggcgcagga 

35 

ctcgtcgacg taggccggcg gggacggctg 
ctgctgctgc tgctgcggct gcggcggggg 
40 caccacctcg ggcatc 

<210> 25 
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tactgagtcc catgttaatc tcctgcgctg 120 

ctgcaatcgg cctcactcaa ggcgcctcac 180 

tctgcgagat ggcctcgccg aacttgagga 240 

tggcgcagga gagggcgccg gtgccccgca 300 

gcgacggctg cggcggcagc tggtggtgcg 360 

gcggcggcgg ctccgggttc accaggcagt 420 

gccggtccga gggctgccgc ttggtgcaga 4 80 

cgcccaggtc cttgcaggng tctcccacgt 540 

tcgccgtctt cgataggatc aaggatgga 599 



ctgctctcgc gttgcgtcac atgacttttt 60 

catgttaatc tcctgcgctg cgttccacaa 120 

cctcactcaa ggcgcctcac cgaacgtcta 180 

ggcctcgccg aacttgagga acagctcggc 240 

gagggcgccg gtgccccgca ggctgacgta 300 

cggcggcagc tggtggtgcg cgtgcgcctt 360 

ctccgggttc accaggcagt caaggacgcg 420 

436 
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<211> 509 
<212> DNA 

<213> partial maize E RECTA 
5 <400> 25 

ccaaagaaaa atggagaggg gggataaaga agatgaggaa gaagctccgc tgctcttgcg 
ttgcgtcaca tgacttttta cagctaacaa caccctagct actgagtccc atgttaatct 
10 cctgcgctgc gtcccacaaa atgtcagtca aactactccc tgcaatcggc ctcactcaag 
gcgcctcacc gaacgtctac gtcttcccct acaccatgtt ctgcgagatg gcctcgccga 



acttgaggaa cagctcggcg tccgaggtgc tggacgagtt ggcgcaggag agggcgccgg 

15 



tgccccgcag gctgacgtac tcgtcgacgt aggccggcgg cgacggctgc ggcggcagct 
ggtggtgcgc gtgcgccttc tgctgctgct gctgcggctg cggcggcggc tccgggttca 
20 ccaggcagtc aaggacgcgc accacctcgt gcatcgtcgg ccggtccgag ggctgccgct 
tggtgcagag gagcgccagc tggaacagc 

25 <210> 26 
<211> 318 
<212> DNA 

<213> partial maize E RECTA 
30 <400> 26 

gatggatcaa tacagcctcc tagtaagtta gaccaccaaa gaaaaatggg gaggggggat 
aaagaagagg aagaagctcc gctgctcttg cgtcacatga ctttttttac agctaacaac 
35 accctagcta ctgagtccca tgttaatctc ctgcgctgcg tcccacaaaa tgtcagtcaa 
actactcccc ctgcaatcgg cctcactcaa ggcgcctcac cgaacgtcta cgtcttcccc 



tacaccatgt tctgcgagat ggcctcgccg aacttgagga acagctcggc gtccgaggtg 

40 



60 
120 
180 
240 
300 
3 60 
420 
480 
509 



60 
120 
180 
240 
300 
318 
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<210> 27 
<211> 103 
<212> DNA 

<213> partial maize ERECTA 

5 

<400> 27 

agcaagccag tgagcaatgg tccacccaag ctggtgatcc ttcatatgaa catggctctt 60 
catgtctttg atgatataat gaggatgact gagaacttga gtg 103 

10 

<210> 28 
<211> 458 
<212> DNA 
15 <213> partial maize ERECTA 

<400> 28 

ataattcacc gggacgtaaa atcaaagaat atactcctcg acaaagatta tgaggcgcat 60 

20 cttacagact tcggcatcgc taagagctta tgtgtctcga agactcacac gtcaacctac 120 

gtcatgggca ctattggtac acttgatcct gagtacgccc gcacctcccg cctcaacgag 180 

aagtctgatg tctacagcta cggcatcgtt ctgctggagc tgctgaccgg caagaagcca 240 

25 

gtggacaacg agtgcaatct ccatcacttg atcctatcga agacggcgag ccaacgaggt 300 

catggagacg gtggaccccg acgtgggaga cacctgcaag gacctgggcg aggtgaagaa 360 

30 gctgttccag ctggcgctcc tctgcaccaa gcggcagccc tcggaccggc cgacgatgca 420 

cgaggtggtg cgcgtccttg actgcctggt gaacccgg 458 

35 <210> 29 
<211> 593 
<212> DNA 

<213> partial maize ERECTA 
40 <400> 29 

tttttttttt tttttttttt ttttttgagg aagaagctcc gctgctcttg cgttgcgtca 60 
catgactttt tacagctaac aacaccctag ctactgagtc ccatgttaat ctcctgcgct 120 
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gcgtcccaca aaatgtcagt caaactactc cctgcaatcg gcctcatttt tttgttgtcc 180 

tcaccgaacg tctacgtctt cccctaca cc atgttctgcg agatggcctc gccgaacttg 240 

5 

aggaacagct cggcgtccga ggtgctggac gagttggcgc aggaaagggc gccggtgccc 300 

cgcaggctga cgtactcgtc gacgtaggcc ggcggcgacg gctgcggcgg cagctggtgg 360 

10 tgcgcgtgcg ccttctgctg ctgctgctgc ggctgcggcg gcggctccgg gttcaccagg 420 

cagtcaagga cgcgcaccac ctcgtgcatc gtcggccggt ccgagggctg ccgcttggtg 480 

cagaggagcg ccagctggaa cagcttcttc acctcgccca ggtccttgca ggtgtctccc 540 

15 

acgtcggggt ccaccggctc catgacctcg ttgctcgccg tcttcgatag gat 593 



<210> 30 
20 <211> 206 
<212> DNA 

<213> partial maize ERECTA 
<400> 30 

25 tcacaaaaga tcatcaagca gaggaacggg agagatgatg atggatcaat acagcctcct 60 
agtaagttag accacaaaga aaaatgggga ggggggataa agaagaggaa gaagctccgc 120 
tgctcttgcg tcacatgact ttttttacag ctaacaacac cctagctact gagtcccatg 180 

30 

ttaatctcct gcgctgcgtc ccacaa 206 



<210> 31 
35 <211> 534 
<212> DNA 

<213> partial maize ERECTA 
<400> 31 

40 caagcagagg aacgggagag atgatgatgg atcaatacag cctcctagta agttagacca 60 
caaagaaaaa tggggagggg ggataaagaa gaggaagaag ctccgctgct cttgcgtcac 120 
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atgacttttt ttacagctaa caacacccta gctactgagt cccatgttaa tctcctgcgc 180 
tgcgtcccac aaaatgtcag tcaaactact ccccctgcaa tcggcctcac tcaaggcgcc 240 
5 tcaccgaacg tctacgtctt cccctacacc atgttctgcg agatggcctc gccgaacttg 300 
aggaacagct cggcgtccga ggtgctggac gagttggcgc aggagagggc gccggtgccc 360 
cgcaggctga cgtactcgtc gacgtaggcc ggcggcgacg gctgcggcgg cagctggtgg 420 

10 

tgcgcgtgcg gcttctgctg ctgctgctgc ggctgcggcg gcggctccgg gttcaccagg 480 
cagtcaagga cgcgcaccac ctcgtgcatc gtcggccggt ccgagggctg ccgc 534 

15 

<210> 32 
<211> 527 
<212> DNA 

<213> partial maize ERECTA 

20 

<400> 32 

gaaagtcaca agatcataag gaagaggaac gggagagatg atgatggatc aatacagcct 60 
cctagtaagt tagaccacca aagaaaaatg gagagggggg ataaagaaga tgaggaagaa 120 

25 

gctccgctgc tcttgcgttg cgtcacatga ctttttacag ctaacaacac cctagctact 180 
gagtcccatg ttaatctcct gcgctgcgtc ccacaaaatg tcagtcaaac tactccctgc 240 
30 aatcggcctc actcaaggcg cctcaccgaa cgtctacgtc ttcccctaca ccatgttctg 300 
cgagatggcc tcgccgaact tgaggaacag ctcggcgtcc gaggtgctgg acgagttggc 360 
gcaggaaagg gcgccggtgc cccgcaggct gacgtactcg tcgacgtagg ccggcggcga 420 

35 

cggctgcggc ggcagctggt ggtgcgcgtg cgccttctgc tgctgctgct gcggctgcgg 480 
cggcggctcc gggtt caeca ggcagtcaag gacgcgcacc acctcgt 527 

40 

<210> 33 
<211> 412 
<212> DNA 
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<213> partial maize E RECTA 
<400> 33 

cttgcgttgc gtcacatgac tttttacagc 

5 

taatctcctg cgctgcgtcc cacaaaatgt 
ctcagggggc ctcaccgaac gtctacgtct 
10 cgccgaactt gaggaacagc tcggcgtccg 
cgccggggcc ccgcaggctg acgtactcgt 
gcagctgggg gtgcgcgtgc gccttctgct 

15 

ggttcaccag gcagtcaagg acgcgcacca 



<210> 34 
20 <211> 533 
<212> DNA 

<213> partial maize E RECTA 
<400> 34 

25 tcgagttttt tttttttttt ttttgatgat 
caccaaagaa aaatggagag gggggataaa 
cgttgcgtca catgactttt tacagctaac 

30 

ctcctgcgct gcgtcccaca aaatgtcagt 
aggcgcctca ccgaacgtct acgtcttccc 
35 gaacttgagg aacagctcgg cgtccgaggt 
ggtgccccgc aggctgacgt actcgtcgac 
ctggtggtgc gcgtgcgcct tctgctgctg 

40 

caccaggcag tcaaggacgc gcaccacctc 
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taacaacacc ctagctactg agtcccatgt 60 

cagtcaaact actccctgca atcggcctca 120 

tcccctacac caggttctgc gagatggcct 180 

aggggctgga cgagttggcg caggaaaggg 240 

cgacgtaggc cggcggcgac ggctgcggcg 300 

gctgctgctg cggttgcggc ggcggctccg 360 

cctcgggcat cgtcggccgg tc 412 



ggatcaatac agcctcctag taagttagac 60 

gaagatgagg aagaagctcc gctgctcttg 120 

aacaccctag ctactgagtc ccatgttaat 180 

caaactactc cctgcaatcg gcctcactca 240 

ctacaccatg ttctgcgaga tggcctcgcc 300 

gctggacgag ttggcgcagg agagggcgcc 360 

gtaggccggc ggcgacggct gcggcggcag 420 

ctgctgcggc tgcggcggcg gctccgggtt 480 

gtgcatcgtc ggccggtccg agg 533 
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<210> 35 
<211> 191 
<212> DNA 

<213> partial maize ERECTA 

5 

<400> 35 

agcctcctag taagttagac caccaaagaa aaatggagag gggggataaa gaagatgagg 60 
aagaagctcc gctgctcttg cgttgcgtca catgactttt tacagctaaa caacacccta 120 

10 

gctactgagt cccatggtaa tctcctgcgc tgcgtcccac aaaatgtcag tcaaactact 180 
ccctgcaatc g 191 

15 

<210> 36 
<211> 683 
<212> DNA 

<213> partial maize ERECTA 

20 

<400> 36 

gacgttggga acctcctctt ttatgcttta tggagagtgg cagcttatgg gatgttttac 60 
atgaaggctc atccaagaag aacaaacttg actgggtgac tcgcctacgg atcgctcttg 120 

25 

gtgcagctca aggcctcgct taccttcacc atgactgcag cccacgaata attcaccggg 180 
acgtaaaatc aaagaatata ctcctcgaca aagattatga ggcgcatctt acagacttcg 240 
30 gcatcgctaa gagcttatgt gtctcgaaga ctcacacgtc aacctacgtc atgggcacta 300 
ttggttacat tgatcctgag tacgcccgca cctcccgcct caacgagaag tctgatgtct 360 
acagctacgg catcgttctg ctggagctgc tgaccggcaa gaagccagtg gacaacgagt 420 

35 

gcaatctcca tcacttgatc ctatcgaaga cggcgagcaa cgaggtcatg gagacggtgg 480 
accccgacgt gggagacacc tgcaaggacc tgggcgaggt gaagaagctg ttccagctgg 540 
40 cgctcctctg caccaagcgg cagccctcgg accggccgac gatgcacgag gtggtgcgcg 600 
tccttgactg cctggtgaac ccggagccgc cgccgcagcc gcagcagcag cagcagaagg 660 
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cgcacgcgca ccaccagctg ccg 



683 



10 



15 



20 



25 



30 



35 



<210> 37 
<211> 610 
<212> DNA 

<213> partial maize E RECTA 
<400> 37 

cttcggcatc gctaagagct tatgtgtctc gaagactcac acgtcaacct acgtcatggg 60 

cactattggt tacattgatc ctgagtacgc ccgcacctcc cgcctcaacg agaagtctga 120 

tgtctacagc tacggcatcg ttctgctgga gctgctgacc ggcaagaagc cagtggacaa 180 

cgagtgcaat ctccatcact tgatcctatc gaagacggcg agcaacgagg tcatggagac 240 

ggtggacccc gacgtgggag acacctgcaa ggacctgggc gaggtgaaga agctgttcca 300 

gctggcgctc ctctgcacca agcggcagcc ctcggaccgg ccgacgatgc acgaggtggt 360 

gcgcgtcctt gactgcctgg tgaacccgga gccgccgccg cagccgcagc agcagcagca 420 

gaaggcgcac gcgcaccacc agctgccgcc gcagccgtcg ccgccggcct acgtcgacga 480 

gtacgtcagc ctgcggggca ccggcgccct ctcctgcgcc aactcgtcca gcacctcgga 540 

cgccgagctg ttcctcaagt tcggcgaggc catctcgcag aacatggtgt aggggaagac 600 

gtagacgttc 610 

<210> 38 

<211> 208 

<212> DNA 

<213> partial maize ERECTA 
<220> 

<221> misc_feature 

<222> (138) . . (138) 

<223> not determined 
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<400> 38 

gcaagccagt gagcaatggt ccacccaagc tgggatcctt catatgaaca tggctcttca 60 
tgtctttgat gatataatga ggatgactga gaacttgagt gagaaataca tcattggata 120 

5 

cggggcatca agtactgntt ataaatgtgt tctaaagaat tgcaaaccag tggcaataaa 180 
aaagctgtat gcccactacc ctcagagc 208 

10 

<210> 39 
<211> 634 
<212> DNA 

<213> partial maize E RECTA 

15 

<400> 39 

gaccgggacg taaaatcaaa gaatatactc ctcgacaaag attatgaggc gcatcttaca 



60 



gacttcggca tcgctaagag cttatgtgtc tcgaagactc acacgtcaac ctacgtcatg 120 

20 

ggcactattg gttacattga tcctgagtac gcccgcacct cccgcctcaa cgagaagtct 180 

gatgtctaca gctacggcat cgttctgctg gagctgctga ccggcaagaa gccagtggac 240 

25 aacgagtgca atctccatca cttgatccta tcgaagacgg cgagcaacga ggtcatggag 300 

acggtggacc ccgacgtggg agacacctgc aaggacctgg gcgaggtgaa gaagctgttc 3 60 

cagctggcgc tcctctgcac caagcggcag ccctcggacc ggccgacgat gcacgaggtg 420 

30 

gtgcgcgtcc ttgactgcct ggtgaacccg gagccgccgc cgcagccgca gcagcagcag 480 

cagaaggcgc acgcgcacca ccagctgccg ccgcagccgt cgccgccggc ctacgtcgac 540 

35 gagtacgtca gcctgcgggg caccggcgcc ctctcctgcg ccaactcgtc cagcacctcg 600 

gacgccgagc tgttcctcaa gttcggcgag gcca 634 



40 <210> 40 

<211> 558 

<212> DNA 

<213> partial maize E RECTA 
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<400> 40 

acttgatgcc ccgtatccaa tgatgtattt 
5 atcatcaaag acatgaagag ccatgttcat 
gctcactggc ttgcttacag tggcatcttt 
agctactaag atcatcagga ggataacaag 

10 

ctttgagatt ggcggtttgt ctcggtggcc 
tccacagagt ccaggattac ctaaaaagct 
15 agtagggaca gcaccagcca aattattgta 
gttcatcaga gaagagacat cgccagttat 
gttttgcagc attccaagtt cttgaggaat 

20 

aatctccatg acacttct 

<210> 41 
25 <211> 429 
<212> DNA 

<213> partial maize E RECTA 
<400> 41 

30 tacttgatgc cccgtatcca atgatgtatt 
tatcatcaaa gacatgaaga gccatgttca 
tgctcactgg cttgcttaca gtggcatctt 

35 

cagctactaa gatcatcagg aggataacaa 
cctttgagat tggcggtttg tctcggtggc 
40 atccacagag tccaggatta cctaaaaagc 
cagtagggac agcaccagcc aaattattgt 




PCT/AU2003/000854 



ctcactcaag ttctcagtca tcctcattat 60 

atgaaggatc accagcttgg gtggaccatt 120 

aaaagcaggt ggatggtgtg gcctgcatac 180 

tccacccaca gcaacaccaa ttatggcagc 240 

agtggaacga catgaagaac caagccaata 300 

gtcatgtgaa aaccgtgtga agttgttgtc 360 

tgacacattt aagatattga ggctgaagca 420 

attgttgttt tccagtttta gcaacatcag 480 

cagaccacca agatgattat aggataaatc 540 

558 



tctcactcaa gttctcagtc atcctcatta 60 

tatgaaggat caccagcttg ggtggaccat 120 

taaaagcagg tggatggtgt ggcctgcata 180 

gtccacccac agcaacacca attatggcag 240 

cagtggaacg acatgaagaa ccaagccaat 300 

tgtcatgtga aaaccgtgtg aagttgttgt 360 

atgacacatt taagatattg aggctgaagc 420 
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agttcatca 



429 



10 



15 



20 



25 



30 



35 



<210> 42 
<211> 556 
<212> DNA 

<213> partial maize ERECTA 
<400> 42 

acatgcaagt caacaggtta actggatcga taccaccaga gctaggaaat atgtcaacac 60 

ttcattacct agaactgaat gataatcaac ttactgggtc aattccacca gagcttggaa 120 

ggctaacagg cttgtttgac ctgaaccttg cgaataacca ccttgaagga ccaattcctg 180 

acaacctaag ttcatgtgtg aatctcaata gcttcaatgc ttatggcaac aagttaaatg 240 

gaaccattcc tcgttcgctg cggaaacttg aaagcatgac ctatttaaat ctttcatcaa 300 

atttcataag tggctctatt cctattgagc tatcaaggat caacaatttg gacacgttgg 360 

acttatcctg taacatgatg acgggtccaa ttccatcatc cattggcaac ctagagcatc 420 

tattgaggct taacttgagc aagaatgatc tagttggatt catccctgcg gagtttggta 480 

atttgagaag tgtcatggag attgatttat cctataatca tcttggtggt ctgattcctc 540 

aagaacttgg aatgct 556 

<210> 43 
<211> 683 
<212> DNA 

<213> partial maize ERECTA 
<400> 43 

gacgttggga acctcctctt ttatgcttta tggagagtgg cagcttatgg gatgttttac 60 
atgaaggctc atccaagaag aacaaacttg actgggtgac tcgcctacgg atcgctcttg 120 
gtgcagctca aggcctcgct taccttcacc atgactgcag cccacgaata attcaccggg 180 
acgtaaaatc aaagaatata ctcctcgaca aagattatga ggcgcatctt acagacttcg 240 
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gcatcgctaa gagcttatgt gtctcgaaga ctcacacgtc aacctacgtc atgggcacta 300 

ttggttacat tgatcctgag tacgcccgca cctcccgcct caacgagaag tctgatgtct 360 

5 

acagctacgg catcgttctg ctggagctgc tgaccggcaa gaagccagtg gacaacgagt 420 

gcaatctcca tcacttgatc ctatcgaaga cggcgagcaa cgaggtcatg gagacggtgg 480 

10 accccgacgt gggagacacc tgcaaggacc tgggcgaggt gaagaagctg ttccagctgg 540 

cgctcctctg caccaagcgg cagccctcgg accggccgac gatgcacgag gtggtgcgcg 600 

tccttgactg cctggtgaac ccggagccgc cgccgcagcc .gcagcagcag cagcagaagg 660 

15 

cgcacgcgca ccaccagctg ccg 683 

<210> 44 

20 <211> 2315 

<212> DNA 

<213> maize E RECTA 

<400> 44 

25 acatgcaagt caacaggtta actggatcga taccaccaga gctaggaaat atgtcaacac 60 

ttcattacct agaactgaat gataatcaac ttactgggtc aattccacca gagcttggaa 120 

ggctaacagg cttgtttgac ctgaaccttg cgaataacca ccttgaagga ccaattcctg 180 

30 

acaacctaag ttcatgtgtg aatctcaata gcttcaatgc ttatggcaac aagttaaatg 240 

gaaccattcc tcgttcgctg cggaaacttg aaagcatgac ctatttaaat ctttcatcaa 300 

35 atttcataag tggctctatt cctattgagc tatcaaggat caacaatttg gacacgttgg 360 

acttatcctg taacatgatg acgggtccaa ttccatcatc cattggcaac ctagagcatc 420 

tattgaggct taacttgagc aagaatgatc tagttggatt catccctgcg gagtttggta 480 

40 

atttgagaag tgtcatggag attgatttat cctataatca tcttggtggt ctgattcctc 540 

aagaacttgg aatgctgcaa aacctgatgt tgctaaaact ggaaaacaac aatataactg 600 
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gcgatgtctc ttctctgatg aactgcttca 
atttggctgg tgctgtccct actgacaaca 

5 

taggtaatcc tggactctgt ggatattggc 
gagacaaacc gccaatctca aaggctgcca 
10 tcctcctgat gatcttagta gctgtatgca 
ccactgtaag caagccagtg agcaatggtc 
tggctcttca tgtctttgat gatataatga 

15 

tcattggata cggggcatca agtactgttt 
tggcaataaa aaagctgtat gcccactacc 
20 tcgagactgt tggtagcatc aaacaccgga 
cacctgttgg gaacctcctc ttttatgctt 
tacatgaagg ctcatccaag aagaacaaac 

25 

ttggtgcagc tcaaggcctc gcttaccttc 
gggacgtaaa atcaaagaat atactcctcg 
30 tcggcatcgc taagagctta tgtgtctcga 
ctattggtta cattgatcct gagtacgccc 
tctacagcta cggcatcgtt ctgctggagc 

35 

agtgcaatct ccatcacttg atcctatcga 
tggaccccga cgtgggagac acctgcaagg 
40 tggcgctcct ctgcaccaag cggcagccct 
gcgtccttga ctgcctggtg aacccggagc 
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gcctcaatat cttaaatgtg tcatacaata 660 

acttcacacg gttttcacat gacagctttt 720 

ttggttcttc atgtcgttcc actggccacc 780 

taattggtgt tgctgtgggt ggacttgtta 840 

ggccacacca tccacctgct tttaaagatg 900 

cacccaagct ggtgatcctt catatgaaca 960 

ggatgactga gaacttgagt gagaaataca 1020 

ataaatgtgt tctaaagaat tgcaaaccag 1080 

tgcagagcct taaggaattt gaaactgagc 1140 

atctagtcag cctgcaaggg tactcgttgt 1200 

atatggagag tggcagctta tgggatgttt 1260 

ttgactgggt gactcgccta cggatcgctc 1320 

accatgactg cagcccacga ataattcacc 1380 

acaaagatta tgaggcgcat cttacagact 1440 

agactcacac gtcaacctac gtcatgggca 1500 

gcacctcccg cctcaacgag aagtctgatg 1560 

tgctgaccgg caagaagcca gtggacaacg 1620 

agacggcgag caacgaggtc atggagacgg 1680 

acctgggcga ggtgaagaag ctgttccagc 1740 

cggaccggcc gacgatgcac gaggtggtgc 1800 

cgccgccgca gccgcagcag cagcagcaga 1860 
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aggcgcacgc gcaccaccag ctgccgccgc agccgtcgcc gccggcctac gtcgacgagt 1920 

acgtcagcct gcggggcacc ggcgccctct cctgcgccaa ctcgtccagc acctcggacg 1980 

5 ccgagctgtt cctcaagttc ggcgaggcca tctcgcagaa catggtgtag gggaagacgt 2040 

agacgttcgg tgaggcgcct tgagtgaggc cgattgcagg gagtagtttg actgacattt 2100 



10 



tgtgggacgc agcgcaggag attaacatgg gactcagtag ctagggtgtt gttagctgta 2160 
aaaagtcatg tgacgcaacg caagagcagc ggagcttctt cctcatcttc tttatccccc 2220 



ctctccattt ttctttggtg gtctaactta ctaggaggct gtattgatcc atcatcatct 2280 



15 ctcccgttcc tcttccttat gatcttgtga ctttc 



2315 



<210> 45 

<211> 675 

20 <212> PRT 

<213> maize ERECT A 

<400> 45 

25 Met Gin Val Asn Arg Leu Thr Gly Ser lie Pro Pro Glu Leu Gly Asn 
15 10 15 



Met Ser Thr Leu His Tyr Leu Glu Leu Asn Asp Asn Gin Leu Thr Gly 
30 20 25 30 



Ser He Pro Pro Glu Leu Gly Arg Leu Thr Gly Leu Phe Asp Leu Asn 
35 40 45 

35 

Leu Ala Asn Asn His Leu Glu Gly Pro He Pro Asp Asn Leu Ser Ser 
50 55 60 



40 



Cys Val Asn Leu Asn Ser Phe Asn Ala Tyr Gly Asn Lys Leu Asn Gly 
65 70 75 80 
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Thr lie Pro Arg Ser Leu Arg Lys Leu Glu Ser Met Thr Tyr Leu Asn 
85 90 95 

5 

Leu Ser Ser Asn Phe lie Ser Gly Ser lie Pro lie Glu Leu Ser Arg 
100 105 110 



10 lie Asn Asn Leu Asp Thr Leu Asp Leu Ser Cys Asn Met Met Thr Gly 
115 120 125 



Pro lie Pro Ser Ser He Gly Asn Leu Glu His Leu Leu Arg Leu Asn 
15 130 135 140 



Leu Ser Lys Asn Asp Leu Val Gly Phe. He Pro Ala Glu Phe Gly Asn 
145 150 155 160 

20 

Leu Arg Ser Val Met Glu He Asp Leu Ser Tyr Asn His Leu Gly Gly 
165 170 175 

25 

Leu He Pro Gin Glu Leu Gly Met Leu Gin Asn Leu Met Leu Leu Lys 
180 185 190 



30 Leu Glu Asn Asn Asn He Thr Gly Asp Val Ser Ser Leu Met Asn Cys 
195 200 205 



Phe Ser Leu Asn He Leu Asn Val Ser Tyr Asn Asn Leu Ala Gly Ala 
35 210 215 220 



Val Pro Thr Asp Asn Asn Phe Thr Arg Phe Ser His Asp Ser Phe Leu 
225 230 235 240 

40 



Gly Asn Pro Gly Leu Cys Gly Tyr Trp Leu Gly Ser Ser Cys Arg Ser 
245 250 255 
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Thr Gly His Arg Asp Lys Pro Pro lie Ser Lys Ala Ala lie He Gly 
260 265 270 

5 

Val Ala Val Gly Gly Leu Val He Leu Leu Met He Leu Val Ala Val 
275 280 285 

10 

Cys Arg Pro His His Pro Pro Ala Phe Lys Asp Ala Thr Val Ser Lys 
290 295 300 



15 Pro Val Ser Asn Gly Pro Pro Lys Leu Val He Leu His Met Asn Met 
305 310 315 320 



Ala Leu His Val Phe Asp Asp He Met Arg Met Thr Glu Asn Leu Ser 
20 325 330 335 



Glu Lys Tyr He He Gly Tyr Gly Ala Ser Ser Thr Val Tyr Lys Cys 
340 345 350 

25 

Val Leu Lys Asn Cys Lys Pro Val Ala He Lys Lys Leu Tyr Ala His 
355 360 365 

30 

Tyr Leu Gin Ser Leu Lys Glu Phe Glu Thr Glu Leu Glu Thr Val Gly 
370 375 380 



35 Ser He Lys His Arg Asn Leu Val Ser Leu Gin Gly Tyr Ser Leu Ser 
385 390 395 400 



Pro Val Gly Asn Leu Leu Phe Tyr Ala Tyr Met Glu Ser Gly Ser Leu 
40 405 410 415 



Trp Asp Val Leu His Glu Gly Ser Ser Lys Lys Asn Lys Leu Asp Trp 
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420 425 430 



Val Thr Arg Leu Arg lie Ala Leu Gly Ala Ala Gin Gly Leu Ala Tyr 
5 435 440 445 



Leu His His Asp Cys Ser Pro Arg lie He His Arg Asp Val Lys Ser 
450 455 460 

10 

Lys Asn He Leu Leu Asp Lys Asp Tyr Glu Ala His Leu Thr Asp Phe 
465 470 475 480 

15 

Gly He Ala Lys Ser Leu Cys Val Ser Lys Thr His Thr Ser Thr Tyr 
485 490 495 



20 Val Met Gly Thr He Gly Tyr He Asp Pro Glu Tyr Ala Arg Thr Ser 
500 ' 505 510 



Arg Leu Asn Glu Lys Ser Asp Val Tyr Ser Tyr Gly He Val Leu Leu 
25 515 520 525 



Glu Leu Leu Thr Gly Lys Lys Pro Val Asp Asn Glu Cys Asn Leu His 
530 535 540 

30 

His Leu He Leu Ser Lys Thr Ala Ser Asn Glu Val Met Glu Thr Val 
545 550 555 560 

35 

Asp Pro Asp Val Gly Asp Thr Cys Lys Asp Leu Gly Glu Val Lys Lys 
565 570 575 



40 Leu Phe Gin Leu Ala Leu Leu Cys Thr Lys Arg Gin Pro Ser Asp Arg 
580 585 590 
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Pro Thr Met His Glu Val Val Arg Val Leu Asp Cys Leu Val Asn Pro 
595 600 605 



5 Glu Pro Pro Pro Gin Pro Gin Gin Gin Gin Gin Lys Ala His Ala His 
^10 615 620 



His Gin Leu Pro Pro Gin Pro Ser Pro Pro Ala Tyr Val Asp Glu Tyr 
10 625 630 635 640 



Val Ser Leu Arg Gly Thr Gly Ala Leu Ser Cys Ala Asn Ser Ser Ser 
645 650 655 

15 

Thr Ser Asp Ala Glu Leu Phe Leu Lys Phe Gly Glu Ala He Ser Gin 
660 665 670 



20 

Asn Met Val 
675 
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